Novel nucleic acids and polypeptides

ABSTRACT

The present invention provides novel nucleic acids, novel polypeptide sequences encoded by these nucleic acids and uses thereof.

1. TECHNICAL FIELD

[0001] The present invention provides novel polynucleotides and proteins encoded by such polynucleotides, along with uses for these polynucleotides and proteins, for example in therapeutic, diagnostic and research methods.

2. BACKGROUND ART

[0002] Technology aimed at the discovery of protein factors (including e.g., cytokines, such as lymphokines, interferons, CSFs, chemokines, and interleukins) has matured rapidly over the past decade. The now routine hybridization cloning and expression cloning techniques clone novel polynucleotides “directly” in the sense that they rely on information directly related to the discovered protein (i.e., partial DNA/amino acid sequence of the protein in the case of hybridization cloning; activity of the protein in the case of expression cloning). More recent “indirect” cloning techniques such as signal sequence cloning, which isolates DNA sequences based on the presence of a now well-recognized secretory leader sequence motif, as well as various PCR-based or low stringency hybridization-based cloning techniques, have advanced the state of the art by making available large numbers of DNA/amino acid sequences for proteins that are known to have biological activity, for example, by virtue of their secreted nature in the case of leader sequence cloning, by virtue of their cell or tissue source in the case of PCR-based techniques, or by virtue of structural similarity to other genes of known biological activity.

[0003] Identified polynucleotide and polypeptide sequences have numerous applications in, for example, diagnostics, forensics, gene mapping; identification of mutations responsible for genetic disorders or other traits, to assess biodiversity, and to produce many other types of data and products dependent on DNA and amino acid sequences.

3. SUMMARY OF THE INVENTION

[0004] The compositions of the present invention include novel isolated polypeptides, novel isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, cloned genes or degenerate variants thereof, especially naturally occurring variants such as allelic variants, antisense polynucleotide molecules, and antibodies that specifically recognize one or more epitopes present on such polypeptides, as well as hybridomas producing such antibodies.

[0005] The compositions of the present invention additionally include vectors, including expression vectors, containing the polynucleotides of the invention, cells genetically engineered to contain such polynucleotides and cells genetically engineered to express such polynucleotides.

[0006] The present invention relates to a collection or library of at least one novel nucleic acid sequences assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by hybridization (SBH), and in some cases, sequences obtained from one or more public databases. The invention relates also to the proteins encoded by such polynucleotides, along with therapeutic, diagnostic and research utilities for these polynucleotides and proteins. These nucleic acid sequences are designated as SEQ ID NO: 1-55 and are provided in the Sequence Listing. In the nucleic acids provided in the Sequence Listing, A is adenosine; C is cytosine; G is guanosine; T is thymine; and N is any of the four bases. In the amino acids provided in the Sequence Listing, * corresponds to the stop codon.

[0007] The nucleic acid sequences of the present invention also include, nucleic acid sequences that hybridize to the complement of SEQ ID NO: 1-55 under stringent hybridization conditions; nucleic acid sequences which are allelic variants or species homologues of any of the nucleic acid sequences recited above, or nucleic acid sequences that encode a peptide comprising a specific domain or truncation of the peptides encoded by SEQ ID NO: 1-55. A polynucleotide comprising a nucleotide sequence having at least 90% identity to an identifying sequence of SEQ ID NO: 1-55 or a degenerate variant or fragment thereof. The identifying sequence can be 100 base pairs in length.

[0008] The nucleic acid sequences of the present invention also include the sequence information from the nucleic acid sequences of SEQ ID NO: 1-55. The sequence information can be a segment of any one of SEQ ID NO: 1-55 that uniquely identifies or represents the sequence information of SEQ ID NO: 1-55. One such segment can be a twenty-mer nucleic acid sequence because the probability that a twenty-mer is fully matched in the human genome is 1 in 300. In the human genome, there are three billion base pairs in one set of chromosome. Because there are 4²⁰ possible twenty-mers exist, there are 300 times more twenty-mers than there are base pairs in a set of human chromosome. Using the same analysis, the probability for a seventeen-mer to be fully matched in the human genome is approximately 1 in 5. When these segments are used in arrays for expression studies, fifteen-mer segment can be used. The probability that the fifteen-mer is fully matched in the expressed sequences is also approximately one in five because expressed sequences in one tissue comprise approximately 5% of the entire genome sequence.

[0009] Similarly, when using a sequence information for detecting a single mismatch, a segment can be a twenty-five mer. The probability that the twenty-five mer would appear in a human genome with a single mismatch is calculated by multiplying the probability for a fill match (1÷4²⁵) times the increased probability for mismatch at each nucleotide position (3×25). The probability that an eighteen mer with a single mismatch can be detected in an array for expression studies is approximately one in five. The probability that a twenty-mer with a single mismatch can be detected in a human genome is approximately one in five.

[0010] A collection as used in this application can be a collection of only one polynucleotide. The collection of sequence information or identifying information of each sequence can be provided on a nucleic acid array. In one embodiment, segments of sequence information is provided on a nucleic acid array to detect the polynucleotide that contains the segment. The array can be designed to detect full-match or mismatch to the polynucleotide that contains the segment. The collection can also be provided in a computer-readable format.

[0011] This invention also includes the reverse or direct complement of any of the nucleic acid sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and host cells or organisms transformed with these expression vectors. Nucleic acid sequences (or their reverse or direct complements) according to the invention have numerous applications in a variety of techniques known to those skilled in the art of molecular biology, such as use as hybridization probes, use as primers for PCR, use in an array, use in computer-readable media, use in sequencing full-length genes, use for chromosome and gene mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA or RNA, their chemical analogs and the like.

[0012] In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1-55 or novel segments or parts of the nucleic acids of the invention are used as primers in expression assays that are well known in the art. In a particularly preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1-55 or novel segments or parts of the nucleic acids provided herein are used in diagnostics for identifying expressed genes or, as well known in the art and exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for physical mapping of the human genome.

[0013] The isolated polynucleotides of the invention include, but are not limited to, a polynucleotide comprising any one of the nucleotide sequences set forth in the SEQ ID NO: 1-55; a polynucleotide comprising any of the full length protein coding sequences of the SEQ ID NO: 1-55; and a polynucleotide comprising any of the nucleotide sequences of the mature protein coding sequences of the SEQ ID NO: 1-55. The polynucleotides of the present invention also include, but are not limited to, a polynucleotide that hybridizes under stringent hybridization conditions to (a) the complement of any one of the nucleotide sequences set forth in the SEQ ID NO: 1-55; (b) a nucleotide sequence encoding any one of the amino acid sequences set forth in the Sequence Listing; a polynucleotide which is an allelic variant of any polynucleotides recited above; a polynucleotide which encodes a species homolog (e.g. orthologs) of any of the proteins recited above; or a polynucleotide that encodes a polypeptide comprising a specific domain or truncation of any of the polypeptides comprising an amino acid sequence set forth in the Sequence Listing.

[0014] The isolated polypeptides of the invention include, but are not limited to, a polypeptide comprising any of the amino acid sequences set forth in the Sequence Listing; or the corresponding full length or mature protein. Polypeptides of the invention also include polypeptides with biological activity that are encoded by (a) any of the polynucleotides having a nucleotide sequence set forth in the SEQ ID NO: 1-55; or (b) polynucleotides that hybridize to the complement of the polynucleotides of (a) under stringent hybridization conditions. Biologically or immunologically active variants of any of the polypeptide sequences in the Sequence Listing, and “substantial equivalents” thereof (e.g., with at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% amino acid sequence identity) that preferably retain biological activity are also contemplated. The polypeptides of the invention may be wholly or partially chemically synthesized but are preferably produced by recombinant means using the genetically engineered cells (e.g. host cells) of the invention.

[0015] The invention also provides compositions comprising a polypeptide of the invention. Polypeptide compositions of the invention may further comprise an acceptable carrier, such as a hydrophilic, e.g., pharmaceutically acceptable, carrier.

[0016] The invention also provides host cells transformed or transfected with a polynucleotide of the invention.

[0017] The invention also relates to methods for producing a polypeptide of the invention comprising growing a culture of the host cells of the invention in a suitable culture medium under conditions permitting expression of the desired polypeptide, and purifying the polypeptide from the culture or from the host cells. Preferred embodiments include those in which the protein produced by such process is a mature form of the protein.

[0018] Polynucleotides according to the invention have numerous applications in a variety of techniques known to those skilled in the art of molecular biology. These techniques include use as hybridization probes, use as oligomers, or primers, for PCR, use for chromosome and gene mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA or RNA, their chemical analogs and the like. For example, when the expression of an mRNA is largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample using, e.g., in situ hybridization.

[0019] In other exemplary embodiments, the polynucleotides are used in diagnostics as expressed sequence tags for identifying expressed genes or, as well known in the art and exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for physical mapping of the human genome.

[0020] The polypeptides according to the invention can be used in a variety of conventional procedures and methods that are currently applied to other proteins. For example, a polypeptide of the invention can be used to generate an antibody that specifically binds the polypeptide. Such antibodies, particularly monoclonal antibodies, are useful for detecting or quantitating the polypeptide in tissue. The polypeptides of the invention can also be used as molecular weight markers, and as a food supplement.

[0021] Methods are also provided for preventing, treating, or ameliorating a medical condition which comprises the step of administering to a mammalian subject a therapeutically effective amount of a composition comprising a polypeptide of the present invention and a pharmaceutically acceptable carrier.

[0022] In particular, the polypeptides and polynucleotides of the invention can be utilized, for example, as part of methods for the prevention and/or treatment of disorders involving aberrant protein expression or biological activity.

[0023] The present invention further relates to methods for detecting the presence of the polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, be utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the identification of subjects exhibiting a predisposition to such conditions. The invention also provides kits comprising polynucleotide probes and/or monoclonal antibodies, and optionally quantitative standards, for carrying out methods of the invention. Furthermore, the invention provides methods for evaluating the efficacy of drugs, and monitoring the progress of patients, involved in clinical trials for the treatment of disorders as recited above.

[0024] The invention also provides methods for the identification of compounds that modulate (i.e., increase or decrease) the expression or activity of the polynucleotides and/or polypeptides of the invention. Such methods can be utilized, for example, for the identification of compounds that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are not limited to, assays for identifying compounds and other substances that interact with (e.g., bind to) the polypeptides of the invention.

[0025] The methods of the invention also provides methods for the treatment of disorders as recited herein which may involve the administration of the polynucleotides or polypeptides of the invention to individuals exhibiting symptoms or tendencies related to disorders as recited herein. In addition, the invention encompasses methods for treating diseases or disorders as recited herein comprising the step of administering compounds and other substances that modulate the overall activity of the target gene products. Compounds and other substances can effect such modulation either on the level of target gene/protein expression or target protein activity.

[0026] The polypeptides of the present invention and the polynucleotides encoding them are also useful for the same functions known to one of skill in the art as the polypeptides and polynucleotides to which they have the closest homology (set forth in Table 1). If no homology is set forth for a sequence, then the polypeptides and polynucleotides of the present invention are useful for a variety of applications, as described herein, including use in arrays for detection.

4. DETAILED DESCRIPTION OF THE INVENTION 4.1. Definitions

[0027] The term “primordial germ cells (PGCs)” refers to a small population of cells set aside from other cell lineages particularly from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to differentiate in to germ cells and other cells. PGCs are the source from which GSCs and ES cells are derived

[0028] The term “germ line stem cells (GSCs)” refers to stem cells derived from primordial stem cells that provide a steady and continuous source of germ cells for the production of gametes.

[0029] The term “embryonic stem cells (ES)” refers to a cell which can give rise to many differentiated cell types in an embryo or an adult, including the germ cells. The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells not only populate the germ line and give rise to a plurality of terminally differentiated cells which comprise the adult specialized organs, but are able to regenerate themselves.

[0030] The term “totipotent” refers to the capability of a cell to differentiate into all of the cell types of an adult organism.

[0031] The term “pluripotent” refers to the capability of a cell to differentiate into a number of differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in its differentiation capability in comparison to a totipotent cell.

[0032] The term “nucleotide sequence” refers to a heteropolymer of nucleotides or the sequence of these nucleotides. The terms “nucleic acid” and “polynucleotide” are also used interchangeably herein to refer to a heteropolymer of nucleotides. Generally, nucleic acid segments provided by this invention may be assembled from fragments of the genome and short oligonucleotide linkers, or from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic acid which is capable of being expressed in a recombinant transcriptional unit comprising regulatory elements derived from a microbial or viral operon, or a eukaryotic gene.

[0033] The terms “oligonucleotide fragment” or a “polynucleotide fragment”, “portion,” or “segment” is a sequence of nucleotide residues which is long enough to use in polymerase chain reaction (PCR) or various hybridization procedures to identify or amplify identical or related parts of mRNA or DNA molecules. A fragment or segment may uniquely identify each polynucleotide sequence of the present invention.

[0034] The terms “oligonucleotides” or “nucleic acid probes” are prepared based on the polynucleotide sequences provided in the present invention. Oligonucleotides comprise portions of such a polynucleotide sequence having at least about 15 nucleotides and usually at least about 20 nucleotides. Nucleic acid probes comprise portions of such a polynucleotide sequence having fewer nucleotides than about 6 kb, usually fewer than about 1 kb. After appropriate testing to eliminate false positives, these probes may, for example, be used to determine whether specific mRNA molecules are present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as described by Walsh et al. (Walsh, P. S. et al., 1992, PCR Methods Appl 1:241-250).

[0035] The term “probes” includes naturally occurring or recombinant or chemically synthesized single- or double-stranded nucleic acids. They may be labeled by nick translation, Klenow fill-in reaction, PCR, or other methods well known in the art. Probes of the present invention, their preparation and/or labeling are elaborated in Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY; or Ausubel, F. M. et al., 1989, Current Protocols in Molecular Biology, John Wiley & Sons, New York N.Y., both of which are incorporated herein by reference in their entirety.

[0036] The term “stringent” is used to refer to conditions that are commonly understood in the art as stringent. Stringent conditions can include highly stringent conditions (i.e., hybridization to filter-bound DNA in 0.5 M NaHPO₄, 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65° C., and washing in 0.1×SSC/0.1% SDS at 68° C.), and moderately stringent conditions (i.e., washing in 0.2×SSC/0.1% SDS at 42° C.). Other exemplary hybridization conditions are described herein in the examples.

[0037] In instances of hybridization of deoxyoligonucleotides, additional exemplary stringent hybridization conditions include washing in 6×SSC/0.05% sodium pyrophosphate at 37° C. (for 14-base oligos), 48° C. (for 17-base oligos), 55° C. (for 20-base oligos), and 60° C. (for 23-base oligos).

[0038] The term “recombinant,” when used herein to refer to a polypeptide or protein, means that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or mammalian) expression systems. “Microbial refers to recombinant polypeptides or proteins made in bacterial or fungal (e.g., yeast) expression systems. As a product, “recombinant microbial” defines a polypeptide or protein essentially free of native endogenous substances and unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in most bacterial cultures, e.g., E. coli, will be free of glycosylation modifications; polypeptides or proteins expressed in yeast will have a glycosylation pattern in general different from those expressed in mammalian cells.

[0039] The term “recombinant expression vehicle or vector” refers to a plasmid or phage or virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression vehicle can comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural or coding sequence which is transcribed into mRNA and translated into protein, and (3) appropriate transcription initiation and termination sequences. Structural units intended for use in yeast or eukaryotic expression systems preferably include a leader sequence enabling extracellular secretion of translated protein by a host cell. Alternatively, where recombinant protein is expressed without a leader or transport sequence, it may include an amino terminal methionine residue. This residue may or may not be subsequently cleaved from the expressed recombinant protein to provide a final product.

[0040] The term “recombinant expression system means host cells which have stably integrated a recombinant transcriptional unit into chromosomal DNA or carry the recombinant transcriptional unit extrachromosomally. Recombinant expression systems as defined herein will express heterologous polypeptides or proteins upon induction of the regulatory elements linked to the DNA segment or synthetic gene to be expressed. This term also means host cells which have stably integrated a recombinant genetic element or elements having a regulatory role in gene expression, for example, promoters or enhancers. Recombinant expression systems as defined herein will express polypeptides or proteins endogenous to the cell upon induction of the regulatory elements linked to the endogenous DNA segment or gene to be expressed. The cells can be prokaryotic or eukaryotic.

[0041] The term “open reading frame,” ORF, means a series of nucleotide triplets coding for amino acids without any termination codons and is a sequence translatable into protein.

[0042] The term “expression modulating fragment,” EMF, means a series of nucleotides which modulates the expression of an operably linked ORF or another EMF.

[0043] As used herein, a sequence is said to “modulate the expression of an operably linked sequence” when the expression of the sequence is altered by the presence of the EMF. EMFs include, but are not limited to, promoters, and promoter modulating sequences (inducible elements). One class of EMFs are fragments which induce the expression or an operably linked ORF in response to a specific regulatory factor or physiological event.

[0044] As used herein, an “uptake modulating fragment,” UMF, means a series of nucleotides which mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily identified using known UMFs as a target sequence or target motif with the computer-based systems described below.

[0045] The presence and activity of a UMF can be confirmed by attaching the suspected UMF to a marker sequence. The resulting nucleic acid molecule is then incubated with an appropriate host under appropriate conditions and the uptake of the marker sequence is determined. As described above, a UMF will increase the frequency of uptake of a linked marker sequence.

[0046] The term “active” refers to those forms of the polypeptide which retain the biologic and/or immunologic activities of any naturally occurring polypeptide. According to the invention, the term “biologically active” means that the polypeptide retains at least one of the biological activities of the polypeptide of the invention.

[0047] The term “naturally occurring polypeptide” refers to polypeptides produced by cells that have not been genetically engineered and specifically contemplates various polypeptides arising from post-translational modifications of the polypeptide including, but not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation.

[0048] The term “derivative” refers to polypeptides chemically modified by such techniques as ubiquitination, labeling (e.g., with radionuclides or various enzymes), covalent polymer attachment such as pegylation (derivatization with polyethylene glycol) and insertion or substitution by chemical synthesis of amino acids such as ornithine, which do not normally occur in human proteins.

[0049] The term “variant” (or “analog”) refers to any polypeptide differing from naturally occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e g., recombinant DNA techniques. Guidance in determining which amino acid residues may be replaced, added or deleted without abolishing activities of interest, may be found by comparing the sequence of the particular polypeptide with that of homologous peptides and minimizing the number of amino acid sequence changes made in regions of high homology (conserved regions) or by replacing amino acids with consensus sequence.

[0050] Preferably, amino acid “substitutions” are the result of replacing one amino acid with another amino acid having similar structural and/or chemical properties, i.e., conservative amino acid replacements. “Conservative” amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues involved. For example, nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; and negatively charged (acidic) amino acids include aspartic acid and glutamic acid. “Insertions” or “deletions” are typically in the range of about 1 to 5 amino acids. The variation allowed may be experimentally determined by systematically making insertions, deletions, or substitutions of amino acids in a polypeptide molecule using recombinant DNA techniques and assaying the resulting recombinant variants for activity.

[0051] Alternatively, where alteration of function is desired, insertions, deletions or non-conservative alterations can be engineered to produce altered polypeptides. Such alterations can, for example, alter one or more of the biological functions or biochemical characteristics of the polypeptides of the invention. For example, such alterations may change polypeptide characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover rate. Further, such alterations can be selected so as to generate polypeptides that are better suited for expression, scale up and the like in the host cells chosen for expression. For example, cysteine residues can be deleted or substituted with another amino acid residue in order to eliminate disulfide bridges.

[0052] As used herein, “substantially equivalent” can refer both to nucleotide and amino acid sequences, for example a mutant sequence, that varies from a reference sequence by one or more substitutions, deletions, or additions, the net effect of which does not result in an adverse functional dissimilarity between the reference and subject sequences. Typically, such a substantially equivalent sequence varies from one of those listed herein by no more than about 20% (i.e., the number of individual residue substitutions, additions, and/or deletions in a substantially equivalent sequence, as compared to the corresponding reference sequence, divided by the total number of residues in the substantially equivalent sequence is about 0.2 or less). Such a sequence is said to have 80% sequence identity to the listed sequence. In one embodiment, a substantially equivalent, e.g., mutant, sequence of the invention varies from a listed sequence by no more than 10% (90% sequence identity); in a variation of this embodiment, by no more than 5% (95% sequence identity); and in a further variation of this embodiment, by no more than 2% (98% sequence identity). Substantially equivalent, e.g., mutant, amino acid sequences according to the invention generally have at least 95% sequence identity with a listed amino acid sequence, whereas substantially equivalent nucleotide sequence of the invention can have lower percent sequence identities, taking into account, for example, the redundancy or degeneracy of the genetic code. For the purposes of the present invention, sequences having substantially equivalent biological activity and substantially equivalent expression characteristics are considered substantially equivalent. For the purposes of determining equivalence, truncation of the mature sequence (e.g., via a mutation which creates a spurious stop codon) should be disregarded. Sequence identity may be determined, e.g., using the Jotun Hein method.

[0053] Nucleic acid sequences encoding such substantially equivalent sequences, e.g., sequences of the recited percent identities, can routinely be isolated and identified via standard hybridization procedures well known to those of skill in the art.

[0054] Where desired, an expression vector may be designed to contain a “signal or leader sequence” which will direct the polypeptide through the membrane of a cell. Such a sequence may be naturally present on the polypeptides of the present invention or provided from heterologous protein sources by recombinant DNA techniques.

[0055] A polypeptide “fragment,” portion,” or “segment” is a stretch of amino acid residues of at least about 5 amino acids, often at least about 7 amino acids, typically at least about 9 to 13 amino acids, and, in various embodiments, at least about 17 or more amino acids. To be active, any polypeptide must have sufficient length to display biological and/or immunological activity.

[0056] Alternatively, recombinant variants encoding these same or similar polypeptides may be synthesized or selected by making use of the “redundancy” in the genetic code. Various codon substitutions, such as the silent changes which produce various restriction sites, may be introduced to optimize cloning into a plasmid or viral vector or expression in a particular prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be reflected in the polypeptide or domains of other peptides added to the polypeptide to modify the properties of any part of the polypeptide, to change characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover rate.

[0057] The term “activated” cells as used in this application are those which are engaged in extracellular or intracellular membrane trafficking, including the export of neurosecretory or enzymatic molecules as part of a normal or disease process.

[0058] The term “purified” as used herein denotes that the indicated nucleic acid or polypeptide is present in the substantial absence of other biological macromolecules, e.g., polynucleotides, polypeptides, proteins, and the like. In one embodiment, the polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more preferably at least 99.8% by weight, of the indicated biological macromolecules present (but water, buffers, and other small molecules, especially molecules having a molecular weight of less than 1000 daltons, can be present).

[0059] The term “isolated” as used herein refers to a nucleic acid or polypeptide separated from at least one other component (e.g., nucleic acid or polypeptide) present with the nucleic acid or polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a solution of the same. The terms “isolated” and “purified” do not encompass nucleic acids or polypeptides present in their natural source.

[0060] The term “infection” refers to the introduction of nucleic acids into a suitable host cell by use of a virus or viral vector.

[0061] The term “transformation” means introducing DNA into a suitable host cell so that the DNA is replicable, either as an extrachromosomal element, or by chromosomal integration.

[0062] The term “transfection” refers to the taking up of an expression vector by a suitable host cell, whether or not any coding sequences are in fact expressed.

[0063] The term “intermediate fragment” means a nucleic acid between 5 and 1000 bases in length, and preferably between 10 and 40 bp in length.

[0064] The term “secreted” includes a protein that is transported across or through a membrane, including transport as a result of signal sequences in its amino acid sequence when it is expressed in a suitable host cell. “Secreted” proteins include without limitation proteins secreted wholly (e.g., soluble proteins) or partially (e.g., receptors) from the cell in which they are expressed. “Secreted” proteins also include without limitation proteins which are transported across the membrane of the endoplasmic reticulum. “Secreted” proteins are also intended to include proteins containing non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, P. A. and Young, P. R. (1992) Cytokine 4(2):134-143) and factors released from damaged cells (e.g. Interleukin-1 Receptor Antagonist, see Arend, W. P. et. al. (1998) Annu. Rev. Immunol. 16:27-55)

[0065] Each of the above terms is meant to encompass all that is described for each, unless the context dictates otherwise.

4.2 Nucleic Acids and Polypeptides of the Invention

[0066] Nucleotide and amino acid sequences of the invention are set forth in the Sequence Listing. Fragments of the polypeptides or proteins of the present invention which are capable of exhibiting biological activity are also encompassed by the present invention. Fragments of the polypeptide or protein may be in linear form or they may be cyclized using known methods, for example, as described in H. U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. McDowell, et al., J. Amer. Chem. Soc. 114, 9245-9253 (1992), both of which are incorporated herein by reference. Such fragments may be fused to carrier molecules such as immunoglobulins for many purposes, including increasing the valency of protein binding sites. For example, fragments of the polypeptide or protein may be fused through “linker” sequences to the Fc portion of an immunoglobulin. For a bivalent form of the protein, such a fusion could be to the Fc portion of an IgG molecule. Other immunoglobulin isotypes may also be used to generate such fusions. For example, a protein-IgM fusion would generate a decavalent form of the protein of the invention.

[0067] The present invention also provides both full-length and mature forms (for example, without a signal sequence or precursor sequence) of the disclosed polypeptides or proteins. The protein coding sequence is identified in the sequence listing by translation of the disclosed nucleotide sequences. The mature form of such protein may be obtained by expression of a full-length polynucleotide in a suitable mammalian cell or other host cell. The sequence of the mature form of the protein is also determinable from the amino acid sequence of the full-length form. Where polypeptides or proteins of the present invention are membrane bound, soluble forms of the polypeptides or proteins are also provided. In such forms, part or all of the regions causing the polypeptides or proteins to be membrane bound are deleted so that the proteins are fully secreted from the cell in which it is expressed.

[0068] The polynucleotides of the invention include naturally occurring or wholly or partially synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. SEQ ID NO: 1-55 may include all of the coding region of the cDNA or may represent a portion of the coding region of the cDNA. Further 5′ and 3′ sequence can be obtained using methods known in the art. For example, full length cDNA or genomic DNA that corresponds to any of the polynucleotides designated as SEQ ID NO: 1-55 can be obtained by screening appropriate cDNA or genomic DNA libraries under suitable hybridization conditions using any of the polynucleotides designated as SEQ ID NO: 1-55 or a portion thereof as a probe. Alternatively, the polynucleotides of the SEQ ID NO: 1-55 may be used as the basis for suitable primer(s) that allow identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries.

[0069] The nucleic acid sequences of the invention, designated as SEQ ID NO: 1-55 can be assembled ESTs and sequences (including cDNA and genomic sequences) obtained from one or more public databases, such as dbEST, gbpri, and UniGene. The sequences falling within the scope of the present invention are not limited to these specific sequences, but also include allelic and species variations thereof. Allelic and species variations can be routinely determined by comparing the sequence provided in SEQ ID NO: 1-55, a representative fragment thereof, or a nucleotide sequence at least 90% identical, preferably 99.9% identical, to SEQ ID NO: 1-55 with a sequence from another isolate of the same species. Furthermore, to accommodate codon variability, the invention includes nucleic acid molecules coding for the same amino acid sequences as do the specific ORFs disclosed herein. In other words, in the coding region of an ORF, substitution of one codon for another which encodes the same amino acid is expressly contemplated.

[0070] The nucleic acids of the present invention, designated as SEQ ID NO: 1-55 can be assembled using an EST sequence as a seed. The EST sequence can be extended using programs or algorithms known in the art. Preferably, a recursive algorithm is used to extend the seed EST into an extended assemblage, by pulling additional sequences from different databases (e.g., Hyseq's database containing EST sequences, dbEST version 114, gb pri 114, and UniGene version 101) that belong to this assemblage. The algorithm terminates when there are no additional sequences from the databases that will extend the assemblage. Further,.the inclusion of component sequences into the assemblage is preferably based on a BLASTN hit to the extending assemblage with BLAST score greater than 300 and percent identity greater than 95%. BLAST, which stands for Basic Local Alignment Search Tool, is used to search for local sequence alignments (Altschul, S. F., J. Mol. Evol. 36: 290-300 (1993) and Altschul, S. F. et al., J. Mol. Biol., 215: 403-10 (1990)). BLAST produces alignments of both nucleotide and amino acid sequences to determine sequence similarity. Because of the local nature of the alignments, BLAST is especially useful in determining exact matches. The sequences can then be reviewed and edited.

[0071] These EST sequences can provide identifying sequence information, representative fragment or segment information, or novel segment information for SEQ ID NO: 1-55.

[0072] The nearest neighbor result for the nucleic acids can be obtained by searching a database using an algorithm or a program. Preferably, a FASTA version 3 search against Genpept, using Fastxy algorithm. The nearest neighbor result shows the closest homologue from Genpept (and contains the translated amino acid sequences for which the nucleic acid sequence encodes).

[0073] The present invention also provides genes corresponding to the cDNA sequences disclosed herein. The corresponding genes can be isolated in accordance with known methods using the sequence information disclosed herein. Such methods include the preparation of probes or primers from the disclosed sequence information for identification and/or amplification of genes in appropriate genomic libraries or other sources of genomic materials.

[0074] Species homologs (or orthologs) of the disclosed polynucleotides and polypeptides or proteins are also provided by the present invention. Species homologs may be isolated and identified by making suitable probes or primers from the sequences provided herein and screening a suitable nucleic acid source from the desired species.

[0075] The invention also encompasses allelic variants of the disclosed polynucleotides or polypeptides; that is, naturally-occurring alternative forms of the isolated polynucleotide which also encode proteins which are identical, homologous or related to that encoded by the polynucleotides.

4.3 Nucleic Acids of the Invention

[0076] The isolated polynucleotides of the invention include, but are not limited to, a polynucleotide encoding a polypeptide comprising any of the amino acid sequences listed in the Sequence Listing; or the mature protein portion thereof. A preferred nucleic acid sequence is set forth as any of the polynucleotides of the SEQ ID NO: 1-55.

[0077] The isolated polynucleotides of the invention further include, but are not limited to a polynucleotide comprising any of the nucleotide sequence designated as SEQ ID NO: 1-55; a polynucleotide comprising the full length protein coding sequence of the polynucleotides designated as SEQ ID NO: 1-55; and a polynucleotide comprising the nucleotide sequence encoding the mature protein coding sequence of the polynucleotides designated as SEQ ID NO: 1-55. The polynucleotides of the present invention also include, but are not limited to, a polynucleotide that preferably has biological activity and that hybridizes under stringent conditions (a) to the complement of any of the nucleotides sequences designated as SEQ ID NO: 1-55 (b) to a polynucleotide encoding the polypeptide of any of the amino acid sequences listed in the Sequence Listing; a polynucleotide which is an allelic variant of any polynucleotide recited above; a polynucleotide which encodes a species homolog of any of the polypeptides or proteins recited above; or a polynucleotide that encodes a polypeptide comprising a specific domain or truncation of any of the polypeptides listed in the Sequence Listing. Domains of interest may depend on the nature of the encoded polypeptide; e.g., domains in receptor-like polypeptides include ligand-binding, extracellular, transmembrane, or cytoplasmic domains, or combinations thereof; domains in immunoglobulin-like proteins include the variable immunoglobulin-like domains; domains in enzyme-like polypeptides include catalytic and substrate binding domains; and domains in ligand polypeptides include receptor-binding domains.

[0078] Polynucleotides encoding preferred polypeptide truncations of the invention can be used to generate polynucleotides encoding chimeric or fusion proteins comprising one or more domains of the invention and heterologous protein sequences.

[0079] The polynucleotides of the invention additionally include the complement of any of the polynucleotides recited above.

[0080] The polynucleotides of the invention also provide polynucleotides including nucleotide sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 75%, at least about 80%, more typically at least about 90%, and even more typically at least about 95%, sequence identity to a polynucleotide recited above. The invention also provides the complement of such polynucleotides. The polynucleotide can be DNA (genomic, cDNA, amplified, or synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well known to those of skill in the art and can include, for example, methods for determining hybridization conditions which can routinely isolate polynucleotides of the desired sequence identities.

[0081] A polynucleotide according to the invention can be joined to any of a variety of other nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). Useful nucleotide sequences for joining to polynucleotides include an assortment of vectors, e.g., plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the art. Accordingly, the invention also provides a vector including a polynucleotide of the invention and a host cell containing the polynucleotide. In general, the vector contains an origin of replication functional in at least one organism, convenient restriction endonuclease sites, and a selectable marker for the host cell. Vectors according to the invention include expression vectors, replication vectors, probe generation vectors, and sequencing vectors. A host cell according to the invention can be a prokaryotic or eukaryotic cell and can be a unicellular organism or part of a multicellular organism.

[0082] The sequences falling within the scope of the present invention are not limited to the specific sequences herein described, but also include allelic variations thereof. Allelic variations can be routinely determined by comparing the nucleotide sequences designated as SEQ ID NO: 1-55, a representative fragment thereof, or a nucleotide sequence at least 99.9% identical to any of the nucleotide sequences of SEQ ID NO: 1-55 with a sequence from another isolate of the same species. To accommodate codon variability, the invention includes nucleic acid molecules coding for the same amino acid sequences as do the specific ORFs disclosed herein. In other words, in the coding region of an ORF, substitution of one codon for another which encodes the same amino acid is expressly contemplated. Any specific sequence disclosed herein can be readily screened for errors by resequencing a particular fragment, such as an ORF, in both directions (i.e., sequence both strands).

[0083] The present invention further provides recombinant constructs comprising a nucleic acid having any of the nucleotide sequences designated as SEQ ID NO: 1-55 or a fragment thereof or any other polynucleotides of the invention. In one embodiment, the recombinant constructs of the present invention comprise a vector, such as a plasmid or viral vector, into which a nucleic acid having any of the nucleotide sequences of SEQ ID NO: 1-55 or a fragment thereof is inserted, in a forward or reverse orientation. In the case of a vector comprising one of the ORFs of the present invention, the vector may further comprise regulatory sequences, including for example, a promoter, operably linked to the ORF. For vectors comprising the EMFs and UMFs of the present invention, the vector may further comprise a marker sequence or heterologous ORF operably linked to the EMF or UMF. Large numbers of suitable vectors and promoters are known to those of skill in the art and are commercially available for generating the recombinant constructs of the present invention. The following vectors are provided by way of example. Bacterial: pBs, phagescript, PsiX174, pBluescript SK, pBs KS, pNH8a, pNH16a, pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia).

[0084] The isolated polynucleotide of the invention may be operably linked to an expression control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et al., Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinantly. Many suitable expression control sequences are known in the art. General methods of expressing recombinant proteins are also known and are exemplified in R. Kaufman, Methods in Enzymology 185, 537-566 (1990). As defined herein “operably linked” means that the isolated polynucleotide of the invention and an expression control sequence are situated within a vector or cell in such a way that the protein is expressed by a host cell which has been transformed (transfected) with the ligated polynucleotide/expression control sequence.

[0085] Promoter regions can be selected from any desired gene using CAT (chloramphenicol transferase) vectors or other vectors with selectable markers. Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial promoters include lacI, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of the appropriate vector and promoter is well within the level of ordinary skill in the art. Generally, recombinant expression vectors will include origins of replication and selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to direct transcription of a downstream structural sequence. Such promoters can be derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid phosphatase, or heat shock proteins, among others. The heterologous structural sequence is assembled in appropriate phase with translation initiation and termination sequences, and preferably, a leader sequence capable of directing secretion of translated protein into the periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a fusion protein including an amino terminal identification peptide imparting desired characteristics, e.g., stabilization or simplified purification of expressed recombinant product. Useful expression vectors for bacterial use are constructed by inserting a structural DNA sequence encoding a desired protein together with suitable translation initiation and termination signals in operable reading phase with a functional promoter. The vector will comprise one or more phenotypic selectable markers and an origin of replication to ensure maintenance of the vector and to, if desirable, provide amplification within the host. Suitable prokaryotic hosts for transformation include E. coli, Bacillus subtilis, Salmonella typhimurium and various species within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may also be employed as a matter of choice.

[0086] As a representative but non-limiting example, useful expression vectors for bacterial use can comprise a selectable marker and bacterial origin of replication derived from commercially available plasmids comprising genetic elements of the well known cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, Wis., USA). These pBR322 “backbone” sections are combined with an appropriate promoter and the structural sequence to be expressed. Following transformation of a suitable host strain and growth of the host strain to an appropriate cell density, the selected promoter is induced or derepressed by appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an additional period. Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained for further purification.

[0087] Included within the scope of the nucleic acid sequences of the invention are nucleic acid sequence fragments that hybridize under stringent conditions to any of the nucleotide sequences of SEQ ID NO: 1-55, or complements thereof, which fragment is greater than about 10 bp, preferably 20 to 50 bp, and even greater than 100 bp, greater than 300 bp, or greater than 500 bp. Fragments of, e.g. 15, 16, or 20 bp or more that are selective for (i.e. specifically hybridize to any one of the polynucleotides of the invention) are contemplated. Probes capable of specifically hybridizing to a polynucleotide can differentiate polynucleotide sequences of the invention from other polynucleotide sequences in the same family of genes or can differentiate human genes from genes of other species, and are preferably based on unique nucleotide sequences.

[0088] In accordance with the invention, polynucleotide sequences comprising the mature protein coding sequences corresponding to SEQ ID NO: 1-55, or functional equivalents thereof, may be used to generate recombinant DNA molecules that direct the expression of that nucleic acid, or a functional equivalent thereof, in appropriate host cells. Also included are the cDNA inserts of any of the clones identified herein.

[0089] The nucleic acid sequences of the invention are further directed to sequences which encode variants of the described nucleic acids. These amino acid sequence variants may be prepared by methods known in the art by introducing appropriate nucleotide changes into a native or variant polynucleotide. There are two variables in the construction of amino acid sequence variants: the location of the mutation and the nature of the mutation. Nucleic acids encoding the amino acid sequence variants are preferably constructed by mutating the polynucleotide to encode an amino acid sequence that does not occur in nature. These nucleic acid alterations can be made at sites that differ in the nucleic acids from different species (variable positions) or in highly conserved regions (constant regions). Sites at such locations will typically be modified in series, e.g., by substituting first with conservative choices (e.g., hydrophobic amino acid to a different hydrophobic amino acid) and then with more distant choices (e.g., hydrophobic amino acid to a charged amino acid), and then deletions or insertions may be made at the target site. Amino acid sequence deletions generally range from about 1 to 30 residues, preferably about 1 to 10 residues, and are typically contiguous. Amino acid insertions include amino- and/or carboxyl-terminal fusions ranging in length from one to one hundred or more residues, as well as intrasequence insertions of single or multiple amino acid residues. Intrasequence insertions may range generally from about 1 to 10 amino residues, preferably from 1 to 5 residues. Examples of terminal insertions include the heterologous signal sequences necessary for secretion or for intracellular targeting in different host cells and sequences such as FLAG or poly-histidine sequences useful for purifying the expressed protein.

[0090] In a preferred method, polynucleotides encoding the novel amino acid sequences are changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter a polynucleotide to encode the desired amino acid variant, as well as a sufficient adjacent nucleotides on both sides of the changed amino acid to form a stable duplex on either side of the site of being changed. In general, the techniques of site-directed mutagenesis are well known to those of skill in the art and this technique is exemplified by publications such as, Edelman et al., DNA 2:183 (1983). A versatile and efficient method for producing site-specific changes in a polynucleotide sequence was published by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 (1982). PCR may also be used to create amino acid sequence variants of the novel nucleic acids. When small amounts of template DNA are used as starting material, primer(s) that differs slightly in sequence from the corresponding region in the template DNA can generate the desired amino acid variant. PCR amplification results in a population of product DNA fragments that differ from the polynucleotide template encoding the polypeptide at the position specified by the primer. The product DNA fragments replace the corresponding region in the plasmid and this gives a polynucleotide encoding the desired amino acid variant.

[0091] A further technique for generating amino acid variants is the cassette mutagenesis technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis techniques well known in the art, such as, for example, the techniques in Sambrook et al., supra, and Current Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent amino acid sequence may be used in the practice of the invention for the cloning and expression of these novel nucleic acids. Such DNA sequences include those which are capable of hybridizing to the appropriate novel nucleic acid sequence under stringent conditions.

[0092] Polynucleotides of the invention can also be used to induce immune responses. For example, as described in Fan et al., Nat. Biotech. 17:870-872 (1999), incorporated herein by reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies against the encoded polypeptide following topical administration of naked plasmid DNA or following injection, and preferably intramuscular injection of the DNA. The nucleic acid sequences are preferably inserted in a recombinant expression vector and may be in the form of naked DNA.

4.4. Hosts

[0093] The present invention further provides host cells genetically engineered to contain the polynucleotides of the invention. For example, such host cells may contain nucleic acids of the invention introduced into the host cell using known transformation, transfection or infection methods. The present invention still further provides host cells genetically engineered to express the polynucleotides of the invention, wherein such polynucleotides are in operative association with a regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in the cell.

[0094] Knowledge of DNA sequences provided by the invention allows for modification of cells to permit, or increase, expression of endogenous polypeptide. Cells can be modified (e.g., by homologous recombination) to provide increased polypeptide expression by replacing, in whole or in part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells express the protein at higher levels. The heterologous promoter is inserted in such a manner that it is operatively linked to the desired protein encoding sequences. See, for example, PCT International Publication No. WO 94/12650, PCT International Publication No. WO 92/20808, and PCT International Publication No. WO 91/09955. It is also contemplated that, in addition to heterologous promoter DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be inserted along with the heterologous promoter DNA. If linked to the desired protein coding sequence, amplification of the marker DNA by standard selection methods results in co-amplification of the desired protein coding sequences in the cells.

[0095] The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. Introduction of the recombinant construct into the host cell can be effected by calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation (Davis, L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one of the polynucleotides of the invention, can be used in conventional manners to produce the gene product encoded by the isolated fragment (in the case of an ORF) or can be used to produce a heterologous protein under the control of the EMF.

[0096] Any host/vector system can be used to express one or more of the ORFs of the present invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-1 cell, COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and B. subtilis. The most preferred cells are those which do not normally express the particular polypeptide or protein or which expresses the polypeptide or protein at low natural level. Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate promoters. Cell-free translation systems can also be employed to produce such proteins using RNAs derived from the DNA constructs of the present invention. Appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et al., in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y. (1989), the disclosure of which is hereby incorporated by reference.

[0097] Various mammalian cell culture systems can also be employed to express recombinant protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, Cell 23:175 (1981), and other cell lines capable of expressing a compatible vector, for example, the C127, 3T3, CHO, HeLa and BHK cell lines. Mammalian expression vectors will comprise an origin of replication, a suitable promoter and also any necessary ribosome binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional termination sequences, and 5′ flanking nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for example, SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide the required nontranscribed genetic elements. Recombinant polypeptides and proteins produced in bacterial culture are usually isolated by initial extraction from cell pellets, followed by one or more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein refolding steps can be used, as necessary, in completing configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can be employed for final purification steps. Microbial cells employed in expression of proteins can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents.

[0098] A number of types of cells may act as suitable host cells for expression of the polypeptide or protein. Mammalian host cells include, for example, monkey COS cells, Chinese Hamster Ovary (CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell strains derived from in vitro culture of primary tissue, primary explants, HeLa cells, mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells.

[0099] Alternatively, it may be possible to produce the polypeptide or protein in lower eukaryotes such as yeast, insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it may be necessary to modify the protein produced therein, for example by phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional protein. Such covalent attachments may be accomplished using known chemical or enzymatic methods.

[0100] In another embodiment of the present invention, cells and tissues may be engineered to express an endogenous gene comprising the polynucleotides of the invention under the control of inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may be replaced by homologous recombination. As described herein, gene targeting can be used to replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene or a novel regulatory sequence synthesized by genetic engineering methods. Such regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations of said sequences. Alternatively, sequences which affect the structure or stability of the RNA or polypeptide or protein produced may be replaced, removed, added, or otherwise modified by targeting. These sequence include polyadenylation signals, mRNA stability elements, splice sites, leader sequences for enhancing or modifying transport or secretion properties of the polypeptide or protein, or other sequences which alter or improve the function or stability of polypeptide or protein or RNA molecules.

[0101] The targeting event may be a simple insertion of the regulatory sequence, placing the gene under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the targeting event may replace an existing element; for example, a tissue-specific enhancer can be replaced by an enhancer that has broader or different cell-type specificity than the naturally occurring elements. Here, the naturally occurring sequences are deleted and new sequences are added. In all cases, the identification of the targeting event may be facilitated by the use of one or more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection of cells in which the exogenous DNA has integrated into the host cell genome. The identification of the targeting event may also be facilitated by the use of one or more marker genes exhibiting the property of negative selection, such that the negatively selectable marker is linked to the exogenous DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and such that a correct homologous recombination event with sequences in the host cell genome does not result in the stable integration of the negatively selectable marker. Markers useful for this purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) gene.

[0102] The gene targeting or gene activation techniques which can be used in accordance with this aspect of the invention are more particularly described in U.S. Pat. No. 5,272,071 to Chappel; U.S. Pat. No. 5,578,461 to Sherwin et al.; International Application No. PCT/US92/09627 (WO93/09222) by Selden et al.; and International Application No. PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference herein in its entirety.

4.5. Polypeptides of the Invention

[0103] The isolated polypeptides of the invention include, but are not limited to, a polypeptide comprising: any one of the amino acid sequences set forth in the Sequence Listing; or an amino acid sequence encoded by any one of the nucleotide sequences designated as SEQ ID NO: 1-55; or the corresponding fall length or mature protein. Polypeptides of the invention also include polypeptides preferably with biological or immunological activity that are encoded by: (a) a polynucleotide having any one of the nucleotide sequences designated as SEQ ID NO: 1-55; or (b) polynucleotides encoding any of the amino acid sequences set forth in the Sequence Listing; or (c) polynucleotides that hybridize to the complement of the polynucleotides of either (a) or (b) under stringent hybridization conditions. The invention also provides biologically active or immunologically active variants of any of the polypeptide amino acid sequences set forth in the Sequence Listing, or the corresponding full length or mature protein; and “substantial equivalents” thereof (e.g., with at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, typically at least about 95%, more typically at least about 98%, or most typically at least about 99% amino acid identity) that retain biological activity. Polypeptides encoded by allelic variants may have a similar, increased, or decreased activity compared to polypeptides comprising any one of the amino acid sequences set forth in the Sequence Listing.

[0104] Polypeptide compositions of the present invention may further comprise an acceptable carrier, such as a hydrophilic, e.g., pharmaceutically acceptable, carrier.

[0105] The invention also relates to methods for producing a polypeptide comprising growing a culture of host cells of the invention in a suitable culture medium, and purifying the polypeptide or protein from the cells or the culture in which the cells are grown. For example, the methods of the invention include a process for producing a polypeptide in which a host cell containing a suitable expression vector that includes a polynucleotide of the invention is cultured under conditions that allow expression of the encoded polypeptide. The polypeptide can be recovered from the culture, conveniently from the culture medium, or from a lysate prepared from the host cells and further purified. Preferred embodiments include those in which the protein produced by such process is a full length or mature form of the protein.

[0106] The present invention further provides isolated polypeptides encoded by the nucleic acid fragments of the present invention or by degenerate variants of the nucleic acid fragments of the present invention. By “degenerate variant” is intended nucleotide fragments which differ from a nucleic acid fragment of the present invention (e.g., an ORF) by nucleotide sequence but, due to the degeneracy of the genetic code, encode an identical polypeptide sequence. Preferred nucleic acid fragments of the present invention are the ORFs that encode proteins. A variety of methodologies known in the art can be utilized to obtain any one of the isolated polypeptides or proteins of the present invention. At the simplest level, the amino acid sequence can be synthesized using commercially available peptide synthesizers. This technique is particularly useful in producing small peptides and fragments of larger polypeptides. Fragments are useful, for example, in generating antibodies against the native polypeptide. In an alternative method, the polypeptide or protein is purified from bacterial cells which naturally produce the polypeptide or protein. One skilled in the art can readily follow known methods for isolating polypeptides and proteins in order to obtain one of the isolated polypeptides or proteins of the present invention. These include, but are not limited to, immunochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, and immuno-affinity chromatography. See, e.g., Scopes, Protein Purification: Principles and Practice, Springer-Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory Manual; Ausubel et al., Current Protocols in Molecular Biology. Polypeptide fragments that retain biological/immunological activity include fragments encoding greater than about 100 amino acids, or greater than about 200 amino acids, and fragments that encode specific protein domains.

[0107] The polypeptides or proteins of the present invention can alternatively be purified from cells which have been altered to express the desired polypeptide or protein. As used herein, a cell is said to be altered to express a desired polypeptide or protein when the cell, through genetic manipulation, is made to produce a polypeptide or protein which it normally does not produce or which the cell normally produces at a lower level. One skilled in the art can readily adapt procedures for introducing and expressing either recombinant or synthetic sequences into eukaryotic or prokaryotic cells in order to generate a cell which produces one of the polypeptides of the present invention.

[0108] The polypeptide or protein of the invention may also be expressed as a product of transgenic animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized by somatic or germ cells containing a nucleotide sequence encoding the polypeptide.

[0109] The polypeptide or protein may also be produced by known conventional chemical synthesis. Methods for constructing the proteins of the present invention by synthetic means are known to those skilled in the art. The synthetically-constructed protein sequences, by virtue of sharing primary, secondary or tertiary structural and/or conformational characteristics with proteins may possess biological properties in common therewith, including protein activity. Thus, they may be employed as biologically active or immunological substitutes for natural, purified proteins in screening of therapeutic compounds and in immunological processes for the development of antibodies.

[0110] The polypeptides or proteins provided herein also include proteins characterized by amino acid sequences similar to those of purified proteins but into which modification are naturally provided or deliberately engineered. For example, modifications in the peptide or DNA sequences can be made by those skilled in the art using known techniques. Modifications of interest in the polypeptide or protein sequences may include the alteration, substitution, replacement, insertion or deletion of a selected amino acid residue in the coding sequence. For example, one or more of the cysteine residues may be deleted or replaced with another amino acid to alter the conformation of the molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are well known to those skilled in the art (see, e.g., U.S. Pat. No. 4,518,584). Preferably, such alteration, substitution, replacement, insertion or deletion retains the desired activity of the protein. Regions of the protein that are important for the protein function can be determined by various methods known in the art including the alanine-scanning method which involved systematic substitution of single or strings of amino acids with alanine, followed by testing the resulting alanine-containing variant for biological activity. This type of analysis determines the importance of the substituted amino acid(s) in biological activity.

[0111] Other fragments and derivatives of the sequences of polypeptides or proteins which would be expected to retain protein activity in whole or in part and are useful for screening or other immunological methodologies may also be easily made by those skilled in the art given the disclosures herein. Such modifications are encompassed by the present invention.

[0112] The polypeptide or protein may also be produced by operably linking the isolated polynucleotide of the invention to suitable control sequences in one or more insect expression vectors, and employing an insect expression system. Materials and methods for baculovirus/insect cell expression systems are commercially available in kit form from, e.g., Invitrogen, San Diego, Calif., U.S.A. (the MaxBat™ kit), and such methods are well known in the art, as described in Summers and Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by reference. As used herein, an insect cell capable of expressing a polynucleotide of the present invention is “transformed.”

[0113] The polypeptide or protein of the invention may be prepared by culturing transformed host cells under culture conditions suitable to express the recombinant protein. The resulting expressed protein may then be purified from such culture (i.e., from culture medium or cell extracts) using known purification processes, such as gel filtration and ion exchange chromatography. The purification of the protein may also include an affinity column containing agents which will bind to the protein; one or more column steps over such affinity resins as concanavalin A-agarose, heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; one or more steps involving hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl ether; or immunoaffinity chromatography.

[0114] Alternatively, the polypeptide or protein of the invention may also be expressed in a form which will facilitate purification. For example, it may be expressed as a fusion protein, such as those of maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as a His tag. Kits for expression and purification of such fusion proteins are commercially available from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and Invitrogen, respectively. The polypeptide or protein can also be tagged with an epitope and subsequently purified by using a specific antibody directed to such epitope. One such epitope (“FLAG®”) is commercially available from Kodak (New Haven, Conn.).

[0115] Finally, one or more reverse-phase high performance liquid chromatography (RP-HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other aliphatic groups, can be employed to further purify the protein. Some or all of the foregoing purification steps, in various combinations, can also be employed to provide a substantially homogeneous isolated recombinant protein. The protein thus purified is substantially free of other mammalian proteins and is defined in accordance with the present invention as an “isolated protein.”

[0116] The polypeptides of the invention include analogs (variants). Analogs embrace fragments, as well as antagonists which comprise one or more amino acids deleted, inserted, or substituted. Analogs of the invention also embrace fusions of the polypeptide of the invention or modifications of the polypeptide of the invention or analog is fused to another moiety or moieties, e.g., targeting moiety, imaging moiety or another therapeutic agent. Such analogs may exhibit improved properties such as activity and/or stability. Examples of moieties which may be fused to polypeptides of the invention or analogs thereof include, for example, targeting moieties which provide for the delivery of polypeptide to desired cell types. Other moieties which may be fused to the polypeptides of the invention include therapeutic agents which are used for treatment of disorders described herein.

4.5.1 Determining Polypeptide and Polynucleotide Identity and Similarity

[0117] Preferred identity and/or similarity are designed to give the largest match between the sequences tested. Methods to determine identity and similarity are codified in publicly available computer programs including, but are not limited to, the GCG program package, including GAP (Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, University of Wisconsin, Madison, Wis.), BLASTP, BLASTN, BLASTX, and FASTA (Atschul, S. F. et al., J. Molec. Biol. 215:403-410 (1990). The BLAST X program is publicly available from the National Center for Biotechnology Information (NCBI) and other sources (BLAST Manual, Altschul, S., et al. NCB NLM NIH Bethesda, Md. 20894; Altschul, S., et al., J. Mol. Biol. 215:403-410 (1990). The preferred computer program is FASTA version 3, specifically the FASTy program within the FASTA program package. Another preferred algorithm is the well-known Smith Waterman algorithm which can also be used to determine identity.

4.6 Gene Therapy

[0118] Mutations in the polynucleotides of the invention gene may result in loss of normal function of the encoded protein. The invention thus provides gene therapy to restore normal activity of the polypeptides of the invention; or to treat disease states involving polypeptides of the invention. Delivery of a functional genes encoding polypeptides of the invention to appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for example, Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). For additional reviews of gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific American: 68-84 (1990); and Miller, Nature, 357: 455460 (1992). Introduction of any one of the nucleotides of the present invention or a gene encoding the polypeptides of the present invention can also be accomplished with extrachromosomal substrates (transient expression) or artificial chromosomes (stable expression). Cells may also be cultured ex vivo in the presence of proteins of the present invention in order to proliferate or to produce a desired effect on or activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. Alternatively, it is contemplated that in other human disease states, preventing the expression of or inhibiting the activity of polypeptides of the invention will be useful in treating the disease states. It is contemplated that antisense therapy or gene therapy could be applied to negatively regulate the expression of polypeptides of the invention.

[0119] Other methods inhibiting expression of a protein include the introduction of antisense molecules to the nucleic acids of the present invention, their complements, or their translated RNA sequences, by methods known in the art, the removal of the nucleic acids of the present invention such as using targeted deletion methods, or the insertion of a negative regulatory element such as a silencer, which is tissue specific. Further, the polypeptides of the present invention can be inhibited by the introduction of antisense molecules that hybridize to nucleic acids that encode for the polypeptides of the present invention and by the removal of a gene that encode for the polypeptides of the present invention.

[0120] The present invention still further provides cells genetically engineered in vivo to express the polynucleotides of the invention, wherein such polynucleotides are in operative association with a regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in the cell. These methods can be used to increase or decrease the expression of the polynucleotides of the present invention.

[0121] Knowledge of DNA sequences provided by the invention allows for modification of cells to permit, increase, or decrease, expression of endogenous polypeptide. Cells can be modified (e.g., by homologous recombination) to provide increased polypeptide expression by replacing, in whole or in part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells express the protein at higher levels. The heterologous promoter is inserted in such a manner that it is operatively linked to the desired protein encoding sequences. See, for example, PCT International Publication No. WO 94/12650, PCT International Publication No. WO 92/20808, and PCT International Publication No. WO 91/09955. It is also contemplated that, in addition to heterologous promoter DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be inserted along with the heterologous promoter DNA. If linked to the desired protein coding sequence, amplification of the marker DNA by standard selection methods results in co-amplification of the desired protein coding sequences in the cells.

[0122] In another embodiment of the present invention, cells and tissues may be engineered to express an endogenous gene comprising the polynucleotides of the invention under the control of inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may be replaced by homologous recombination. As described herein, gene targeting can be used to replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene or a novel regulatory sequence synthesized by genetic engineering methods. Such regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations of said sequences. Alternatively, sequences which affect the structure or stability of the RNA or protein produced may be replaced, removed, added, or otherwise modified by targeting. These sequence include polyadenylation signals, mRNA stability elements, splice sites, leader sequences for enhancing or modifying transport or secretion properties of the protein, or other sequences which alter or improve the function or stability of protein or RNA molecules.

[0123] The targeting event may be a simple insertion of the regulatory sequence, placing the gene under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the targeting event may replace an existing element; for example, a tissue-specific enhancer can be replaced by an enhancer that has broader or different cell-type specificity than the naturally occurring elements. Here, the naturally occurring sequences are deleted and new sequences are added. In all cases, the identification of the targeting event may be facilitated by the use of one or more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection of cells in which the exogenous DNA has integrated into the cell genome. The identification of the targeting event may also be facilitated by the use of one or more marker genes exhibiting the property of negative selection, such that the negatively selectable marker is linked to the exogenous DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and such that a correct homologous recombination event with sequences in the host cell genome does not result in the stable integration of the negatively selectable marker. Markers useful for this purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) gene.

[0124] The gene targeting or gene activation techniques which can be used in accordance with this aspect of the invention are more particularly described in U.S. Pat. No. 5,272,071 to Chappel; U.S. Pat. No. 5,578,461 to Sherwin et al.; International Application No. PCT/US92109627 (WO93/09222) by Selden et al.; and International Application No. PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference herein in its entirety.

4.7 Transgenic Animals

[0125] In preferred methods to determine biological functions of the polypeptides of the invention in vivo, one or more genes provided by the invention are either over expressed or inactivated in the germ line of animals using homologous recombination [Capecchi, Science 244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory control of exogenous or endogenous promoter elements, are known as transgenic animals. Animals in which an endogenous gene has been inactivated by homologous recombination are referred to as “knockout” animals. Knockout animals, preferably non-human mammals, can be prepared as described in U.S. Pat. No. 5,557,032, incorporated herein by reference. Transgenic animals are useful to determine the roles polypeptides of the invention play in biological processes, and preferably in disease states. Transgenic animals are useful as model systems to identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human mammals, are produced using methods as described in U.S. Pat. No 5,489,743 and PCT Publication No. WO94/28122, incorporated herein by reference.

[0126] Transgenic animals can be prepared wherein all or part of a polynucleotides of the invention promoter is either activated or inactivated to alter the level of expression of the polypeptides of the invention. Inactivation can be carried out using homologous recombination methods described above. Activation can be achieved by supplementing or even replacing the homologous promoter to provid for increased protein expression. The homologous promoter can be supplemented by insertion of one or more heterologous enhancer elements known to confer promoter activation in a particular tissue.

4.8. Uses and Biological Activity

[0127] The polynucleotides and polypeptides of the present invention are expected to exhibit one or more of the uses or biological activities (including those associated with assays cited herein) identified herein. Uses or activities described for polypeptides or proteins of the present invention may be provided by administration or use of such polypeptides or proteins or of polynucleotides encoding such polypeptides or proteins (such as, for example, in gene therapies or vectors suitable for introduction of DNA). The mechanism underlying the particular condition or pathology will dictate whether the polypeptides of the invention, the polynucleotides of the invention or modulators (activators or inhibitors) thereof would be beneficial to the subject in need of treatment. Thus, “therapeutic compositions of the invention” include compositions comprising isolated polynucleotides (including recombinant DNA molecules, cloned genes and degenerate variants thereof) or polypeptides of the invention (including full length protein, mature protein and truncations or domains thereof), or compounds and other substances that modulate the overall activity of the target gene products, either at the level of target gene/protein expression or target protein activity. Such modulators include polypeptides, analogs, (variants), including fragments and fusion proteins, antibodies and other binding proteins; chemical compounds that directly or indirectly activate or inhibit the polypeptides of the invention (identified, e.g., via drug screening assays as described herein); antisense polynucleotides and polynucleotides suitable for triple helix formation; and in particular antibodies or other binding partners that specifically recognize one or more epitopes of the polypeptides of the invention.

[0128] The polypeptide or protein of the present invention may likewise be involved in cellular activation or in one of the other physiological pathways described herein.

4.8.1. Research Uses and Utilities

[0129] The polynucleotides provided by the present invention can be used by the research community for various purposes. The polynucleotides can be used to express recombinant protein for analysis, characterization or therapeutic use; as markers for tissues in which the corresponding protein is preferentially expressed (either constitutively or at a particular stage of tissue differentiation or development or in disease states); as molecular weight markers on gels; as chromosome markers or tags (when labeled) to identify chromosomes or to map related gene positions; to compare with endogenous DNA sequences in patients to identify potential genetic disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of information to derive PCR primers for genetic fingerprinting; as a probe to “subtract-out” known sequences in the process of discovering other novel polynucleotides; for selecting and making oligomers for attachment to a “gene chip” or other support, including for examination of expression patterns; to raise anti-protein antibodies using DNA immunization techniques; and as an antigen to raise anti-DNA antibodies or elicit another immune response. Where the polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for example, in a receptor-ligand interaction), the polynucleotide can also be used in interaction trap assays (such as, for example, that described in Gyuris et al., Cell 75:791-803 (1993)) to identify polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of the binding interaction.

[0130] The polypeptides or proteins provided by the present invention can similarly be used in assays to determine biological activity, including in a panel of multiple proteins for high-throughput screening; to raise antibodies or to elicit another immune response; as a reagent (including the labeled reagent) in assays designed to quantitatively determine levels of the polypeptide or protein (or its receptor) in biological fluids; as markers for tissues in which the corresponding polypeptide or protein is preferentially expressed (either constitutively or at a particular stage of tissue differentiation or development or in a disease state); and, of course, to isolate correlative receptors or ligands. Where the protein binds or potentially binds to another polypeptide or protein (such as, for example, in a receptor-ligand interaction), the polypeptide or protein can be used to identify the other polypeptide or protein with which binding occurs or to identify inhibitors of the binding interaction. Polypeptides or proteins involved in these binding interactions can also be used to screen for peptide or small molecule inhibitors or agonists of the binding interaction.

[0131] Any or all of these research utilities are capable of being developed into reagent grade or kit format for commercialization as research products.

[0132] Methods for performing the uses listed above are well known to those skilled in the art. References disclosing such methods include without limitation “Molecular Cloning: A Laboratory Manual”, 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch and T. Maniatis eds., 1989, and “Methods in Enzymology: Guide to Molecular Cloning Techniques”, Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987.

4.8.2. Nutritional Uses

[0133] Polynucleotides and polypeptides or proteins of the present invention can also be used as nutritional sources or supplements. Such uses include without limitation use as a protein or amino acid supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In such cases the polypeptide or polynucleotide of the invention can be added to the feed of a particular organism or can be administered as a separate solid or liquid preparation, such as in the form of powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the polypeptide or polynucleotide of the invention can be added to the medium in or on which the microorganism is cultured.

4.8.3. Cytokine and Cell Proliferation/Differentiation Activity

[0134] A polypeptide of the present invention may exhibit activity relating to cytokine, cell proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) activity or may induce production of other cytokines in certain cell populations. A polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many polypeptide or protein factors discovered to date, including all known cytokines, have exhibited activity in one or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient confirmation of cytokine activity. The activity of therapeutic compositions of the present invention is evidenced by any one of a number of routine factor dependent cell proliferation assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/11, BaF3, MC9/G, M+(preB M+), 2E8, RB5, DA1, 123, T1165, HT2, CTLL2, TF-1, Mo7e, CMK, HUVEC, and Caco. Therapeutic compositions of the invention can be used in the following:

[0135] Assays for T-cell or thymocyte proliferation include without limitation those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. immunol. 145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Bertagnolli, et al., I. Immunol. 149:3778-3783, 1992; Bowman et al., I. Immunol. 152:1756-1761, 1994.

[0136] Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse and human interleukin-γ, Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994.

[0137] Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al., Nature 336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse and human interleukin 6—Nordan, R. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human Interleukin 11—Bennett, F., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; Measurement of mouse and human Interleukin 9—Ciarletta, A., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991.

[0138] Assays for T-cell clone responses to antigens (which will identify, among others, proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring proliferation and cytokine production) include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et al., Proc. Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et al., Eur. J. Inmunol. 11:405411, 1981; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988.

4.8.4 Stem Cell Growth Factor Activity

[0139] A polypeptide of the present invention may exhibit stem cell growth factor activity and be involved in the proliferation, differentiation and survival of pluripotent and totipotent stem cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or germ line stem cells. Administration of the polypeptide of the invention to stem cells in vivo or ex vivo is expected to maintain and expand cell populations in a totipotential or pluripotential state which would be useful for re-engineering damaged or diseased tissues, transplantation, manufacture of bio-pharmaceuticals and the development of bio-sensors. The ability to produce large quantities of human cells has important working applications for the production of human proteins which currently must be obtained from non-human sources or donors, implantation of cells to treat diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases; tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung.

[0140] It is contemplated that multiple different exogenous growth factors and/or cytokines may be administered in combination with the polypeptide of the invention to achieve the desired effect, including any of the growth factors listed herein, other stem cell maintenance factors, and specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), Flt-3 ligand (Flt-3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, macrophage inflammatory protein 1-alpha (MIP-1-alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors and basic fibroblast growth factor (bFGF).

[0141] Since totipotent stem cells can give rise to virtually any mature cell type, expansion of these cells in culture will facilitate the production of large quantities of mature cells. Techniques for culturing stem cells are known in the art and administration of polypeptides of the invention, optionally with other growth factors and/or cytokines, is expected to enhance the survival and proliferation of the stem cell populations. This can be accomplished by direct administration of the polypeptide of the invention to the culture medium. Alternatively, stroma cells transfected with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder layer for the stem cell populations in culture or in vivo. Stromal support cells for feeder layers may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or cultured embryonic fibroblasts (see U.S. Pat. No.; 5,690,926).

[0142] Stem cells themselves can be transfected with a polynucleotide of the invention to induce autocrine expression of the polypeptide of the invention. This will allow for generation of undifferentiated totipotential/pluripotential stem cell lines that are useful as is or that can then be differentiated into the desired mature cell types. These stable cell lines can also serve as a source of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates for polymerase chain reaction experiments. These studies would allow for the isolation and identification of differentially expressed genes in stem cell populations that regulate stem cell proliferation and/or maintenance.

[0143] Expansion and maintenance of totipotent stem cell populations will be useful in the treatment of many pathological conditions. For example, polypeptides of the present invention may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be used to augment or replace cells damaged by illness, autoimmune disease, accidental damage or genetic disorders. The polypeptide of the invention may be useful for inducing the proliferation of neural cells and for the regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In addition, the expanded stem cell populations can also be genetically altered for gene therapy purposes and to decrease host rejection of replacement tissues after grafting or implantation.

[0144] Expression of the polypeptide of the invention and its effect on stem cells can also be manipulated to achieve controlled differentiation of the stem cells into more differentiated cell types. A broadly applicable method of obtaining pure populations of a specific differentiated cell type from undifferentiated stem cell populations involves the use of a cell-type specific promoter driving a selectable marker. The selectable marker allows only cells of the desired type to survive. For example, stem cells can be induced to differentiate into cardiomyocytes (Wobus et al., Differentiation, 48: 173-182 , (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) or skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., Academic Press (1997)). Alternatively, directed differentiation of stem cells can be accomplished by culturing the stem cells in the presence of a differentiation factor such as retinoic acid and an antagonist of the polypeptide of the invention which would inhibit the effects of endogenous stem cell factor activity and allow differentiation to proceed.

[0145] In vitro cultures of stem cells can be used to determine if the polypeptide of the invention exhibits stem cell growth factor activity. Stem cells are isolated from any one of various cell sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 92: 7844-7848 (1995), in the presence of the polypeptide of the invention alone or in combination with other growth factors or cytokines. The ability of the polypeptide of the invention to induce stem cells proliferation is determined by colony formation on semi-solid support e.g. as described by Bernstein et al., Blood, 77: 2316-2321 (1991).

4.8.5. Hematopoiesis Regulating Activity

[0146] A polypeptide or protein of the present invention may be involved in regulation of hematopoiesis and, consequently, in the treatment of myeloid or lymphoid cell deficiencies. Even marginal biological activity in support of colony forming cells or of factor-dependent cell lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of erythroid progenitor cells alone or in combination with other cytokines, thereby indicating utility, for example, in treating various anemias or for use in conjunction with irradiation/chemotherapy to stimulate the production of erythroid precursors and/or erythroid cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and monocytes/macrophages (i.e., traditional CSF activity) useful, for example, in conjunction with chemotherapy to prevent or treat consequent myelo-suppression; in supporting the growth and proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or treatment of various platelet disorders such as thrombocytopenia, and generally for use in place of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell disorders (such as those usually treated with transplantation, including, without limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment post irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) as normal cells or genetically manipulated for gene therapy.

[0147] Therapeutic compositions of the invention can be used in the following:

[0148] Suitable assays for proliferation and differentiation of various hematopoietic lines are cited above.

[0149] Assays for embryonic stem cell differentiation (which will identify, among others, proteins that influence embryonic differentiation hematopoiesis) include, without limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., Molecular and Cellular Biology 13:473486, 1993; McClanahan et al., Blood 81:2903-2915, 1993.

[0150] Assays for stem cell survival and differentiation (which will identify, among others, proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; Hirayama et al., Proc. Natl. Acad. Sci. USA 89:5907-5911, 1992; Primitive hematopoietic colony forming cells with high proliferative potential, McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et al., Experimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 1-21, Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, N.Y. 1994; Long term culture initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994.

4.8.6 Immune Stimulating or Suppressing Activity

[0151] Compositions of the present invention may also exhibit immune stimulating or immune suppressing activity, including without limitation the activities for which assays are described herein. A polynucleotide of the invention can encode a polypeptide involved in such activities. A protein or antibody, other binding partner, or other modulator of the invention may be useful in the treatment of various immune deficiencies and disorders (including severe combined immunodeficiency (SCID)), e.g., in regulating (up or down) growth and proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells and other cell populations. These immune deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or fungal infections, or may result from autoimmune disorders. More specifically, infectious diseases caused by viral, bacterial, fungal or other infection may be treatable using a protein, antibody, binding partner, or other modulator of the invention, including infections by HIV, hepatitis viruses, herpesviruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such as candidiasis, as well as other conditions where a boost to the immune system generally may be desirable, e.g., in the treatment of cancer.

[0152] Autoimmune disorders which may involve a protein of the present invention include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft-versus-host disease and autoimmune inflammatory eye disease. Such a polypeptide or protein of the present invention may also to be involved in allergic reactions and conditions, such as asthma (particularly allergic asthma), bronchitis or other respiratory problems.

[0153] Using the polypeptides or proteins, antibody, binding partners, or other modulators of the invention it may also be possible to modulate immune responses, in a number of ways. The immune response may be enhanced or suppressed. Down regulation may be in the form of inhibiting or blocking an immune response already in progress or may involve preventing the induction of an immune response. The functions of activated T cells may be inhibited by suppressing T cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression of T cell responses is generally an active, non-antigen-specific, process which requires continuous exposure of the T cells to the suppressive agent. Tolerance, which involves inducing non-responsiveness or anergy in T cells, is distinguishable from immunosuppression in that it is generally antigen-specific and persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence of the tolerizing agent.

[0154] Down regulating or preventing the immune response, e.g., preventing high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and organ transplantation and in graft-versus-host disease (GVHD). For example, blockage of T cell function should result in reduced tissue destruction in tissue transplantation. Typically, in tissue transplants, rejection of the transplant is initiated through its recognition as foreign by T cells, followed by an immune reaction that destroys the transplant. The administration of a molecule which inhibits or blocks the immune response (e.g. a receptor fragment, binding partner, or other modulator such as antisense polynucleotides) may act as an immunosuppressant.

[0155] The efficacy of particular immune response modulators in preventing organ transplant rejection or GVHD can be assessed using animal models that are predictive of efficacy in humans. Examples of appropriate systems which can be used include allogeneic cardiac grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et al., Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. Sci USA, 89:11102-11105 (1992). In addition, murine models of GVHD (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to determine the effect of blocking B lymphocyte antigen function in vivo on the development of that disease.

[0156] Blocking the inflammatory response may also be therapeutically useful for treating autoimmune diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that are reactive against self tissue and which promote the production of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the activation of autoreactive T cells may reduce or eliminate disease symptoms. Administration of reagents which block costimulation of T cells can be used to inhibit T cell activation and prevent production of autoantibodies or T cell-derived cytokines which may be involved in the disease process. Additionally, blocking reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to long-term relief from the disease. The efficacy of blocking reagents in preventing or alleviating autoimmune disorders can be determined using a number of well-characterized animal models of human autoimmune diseases. Examples include murine experimental autoimmune encephalitis, systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental myasthenia gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 840-856).

[0157] Upregulation of immune responses, may also be useful in therapy. Upregulation of immune responses may be in the form of enhancing an existing immune response or eliciting an initial immune response. For example, enhancing an immune response may be useful in cases of viral infection such as influenza, the common cold, and encephalitis.

[0158] Alternatively, anti-viral immune responses may be enhanced in an infected patient by removing T cells from the patient, costimulating the T cells in vitro and reintroducing the in vitro activated T cells into the patient.

[0159] The activity of therapeutic compositions of the invention may, among other means, be measured by the following methods:

[0160] Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988; Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. Immunol. 135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Bowmanet al., J. Virology 61:1992-1998; Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 1994.

[0161] Assays for T-cell-dependent immunoglobulin responses and isotype switching (which will identify, among others, proteins that modulate T-cell dependent antibody responses and that affect Th1/Th2 profiles) include, without limitation, those described in: Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994.

[0162] Mixed lymphocyte reaction (MLR) assays (which will identify, among others, proteins that generate predominantly Th1 and CTL responses) include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992.

[0163] Dendritic cell-dependent assays (which will identify, among others, proteins expressed by dendritic cells that activate naive T-cells) include, without limitation, those described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of Experimental Medicine 173:549-559, 1991; Macatonia et al., Journal of Immunology 154:5071-5079, 1995; Porgador et al., Journal of Experimental Medicine 182:255-260, 1995; Nair et al., Journal of Virology 67:4062-4069, 1993; Huang et al., Science 264:961-965, 1994; Macatonia et al., Journal of Experimental Medicine 169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical Investigation 94:797-807, 1994; and Inaba et al., Journal of Experimental Medicine 172:631-640, 1990.

[0164] Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et al., Cancer Research 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, Journal of Immunology 145:40374045, 1990; Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et al., International Journal of Oncology 1:639-648, 1992.

[0165] Assays for proteins that influence early steps of T-cell commitment and development include, without limitation, those described in: Antica et al., Blood 84:111-117, 1994; Fine et al., Cellular Immunology 155:111-122, 1994; Galy et al., Blood 85:2770-2778, 1995; Toki et al., Proc. Nat. Acad Sci. USA 88:7548-7551, 1991.

4.8.7 Tissue Growth Activity

[0166] A polypeptide or protein of the present invention also may be involved in bone, cartilage, tendon, ligament and/or nerve tissue growth or regeneration, as well as in wound healing and tissue repair and replacement, and in healing of burns, incisions and ulcers.

[0167] For example, induction of cartilage and/or bone growth in circumstances where bone is not normally formed, has application in the healing of bone fractures and cartilage damage or defects in humans and other animals. Compositions of a protein, antibody, binding partner, or other modulator of the invention may have prophylactic use in closed as well as open fracture reduction and also in the improved fixation of artificial joints. De novo bone formation induced by an osteogenic agent contributes to the repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and also is useful in cosmetic plastic surgery.

[0168] A polypeptide or protein of this invention may also be involved in attracting bone-forming cells, stimulating growth of bone-forming cells, or inducing differentiation of progenitors of bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) mediated by inflammatory processes may also be possible using the composition of the invention.

[0169] Another category of tissue regeneration activity that may involve the protein of the present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue or other tissue formation in circumstances where such tissue is not normally formed, has application in the healing of tendon or ligament tears, deformities and other tendon or ligament defects in humans and other animals. Such a preparation employing a tendon/ligament-like tissue inducing protein may have prophylactic use in preventing damage to tendon or ligament tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and in repairing defects to tendon or ligament tissue. De novo tendon/ligament-like tissue formation induced by a composition of the present invention contributes to the repair of congenital, trauma induced, or other tendon or ligament defects of other origin, and is also useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. The compositions of the present invention may provide environment to attract tendon- or ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce differentiation of progenitors of tendon- or ligament-forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue repair. The compositions of the invention may also be useful in the treatment of tendinitis, carpal tunnel syndrome and other tendon or ligament defects. The compositions may also include an appropriate matrix and/or sequestering agent as a carrier as is well known in the art.

[0170] The compositions of the present invention may also be useful for proliferation of neural cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a composition may be used in the treatment of diseases of the peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager syndrome. Further conditions which may be treated in accordance with the present invention include mechanical and traumatic disorders, such as spinal cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting from chemotherapy or other medical therapies may also be treatable using a composition of the invention.

[0171] Compositions of the invention may also be useful to promote better or faster closure of non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular insufficiency, surgical and traumatic wounds, and the like.

[0172] Compositions of the present invention may also be involved in the generation or regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular endothelium) tissue, or for promoting the growth of cells comprising such tissues. Inhibition or modulation of fibrotic scarring may allow normal tissue to regenerate.

[0173] A composition of the present invention may also be useful for gut protection or regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and conditions resulting from systemic cytokine damage.

[0174] A composition of the present invention may also be useful for promoting or inhibiting differentiation of tissues described above from precursor tissues or cells; or for inhibiting the growth of tissues described above.

[0175] Therapeutic compositions of the invention can be used in the following:

[0176] Assays for tissue generation activity include, without limitation, those described in: International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent Publication No. WO95105846 (nerve, neuronal); International Patent Publication No. WO91/07491 (skin, endothelium).

[0177] Assays for wound healing activity include, without limitation, those described in: Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.), Year Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 71:382-84 (1978).

4.8.8 Activin/Inhibin Activity

[0178] A polypeptide or protein of the present invention may also exhibit activin- or inhibin-related activities. A polynucleotide of the invention may encode a polypeptide exhibiting such characteristics. Inhibins are characterized by their ability to inhibit the release of follicle stimulating hormone (FSH), while activins and are characterized by their ability to stimulate the release of follicle stimulating hormone (FSH). Thus, a polypeptide or protein of the present invention, alone or in heterodimers with a member of the inhibin family, may be useful as a contraceptive based on the ability of inhibins to decrease fertility in female mammals and decrease spermatogenesis in male mammals. Administration of sufficient amounts of other inhibins can induce infertility in these mammals. Alternatively, the polypeptide or protein of the invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin group, may be useful as a fertility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A polypeptide or protein of the invention may also be useful for advancement of the onset of fertility in sexually immature mammals, so as to increase the lifetime reproductive performance of domestic animals such as, but not limited to, cows, sheep and pigs.

[0179] The activity of a polypeptide or protein of the invention may, among other means, be measured by the following methods.

[0180] Assays for activin/inhibin activity include, without limitation, those described in: Vale et al., Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et al., Nature 321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. Natl. Acad. Sci. USA 83:3091-3095, 1986.

4.8.9 Chemotactic/Chemokinetic Activity

[0181] A polypeptide or protein of the present invention may be involved in chemotactic or chemokinetic activity for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Chemotactic and chemokinetic receptor activation can be used to mobilize or attract a desired cell population to a desired site of action. Chemotactic or chemokinetic compositions (e.g. proteins, antibodies, binding partners, or modulators of the invention) provide particular advantages in treatment of wounds and other trauma to tissues, as well as in treatment of localized infections. For example, attraction of lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in improved immune responses against the tumor or infecting agent.

[0182] A protein or peptide has chemotactic activity for a particular cell population if it can stimulate, directly or indirectly, the directed orientation or movement of such cell population. Preferably, the protein or peptide has the ability to directly stimulate directed movement of cells. Whether a particular protein has chemotactic activity for a population of cells can be readily determined by employing such protein or peptide in any known assay for cell chemotaxis.

[0183] Therapeutic compositions of the invention can be used in the following:

[0184] Assays for chemotactic activity (which will identify proteins that induce or prevent chemotaxis) consist of assays that measure the ability of a protein to induce the migration of cells across a membrane as well as the ability of a protein to induce the adhesion of one cell population to another cell population. Suitable assays for movement and adhesion include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 1995; Muller et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 1994; Johnston et al. J. of Immunol. 153:1762-1768, 1994.

4.8.10 Hemostatic and Thrombolytic Activity

[0185] A polypeptide or protein of the invention may also be involved in hemostatis or thrombolysis or thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Compositions may be useful in treatment of various coagulation disorders (including hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events in treating wounds resulting from trauma, surgery or other causes. A composition of the invention may also be useful for dissolving or inhibiting formation of thromboses and for treatment and prevention of conditions resulting therefrom (such as, for example, infarction of cardiac and central nervous system vessels (e.g., stroke).

[0186] Therapeutic compositions of the invention can be used in the following:

[0187] Assay for hemostatic and thrombolytic activity include, without limitation, those described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 35:467-474, 1988.

4.8.11 Cancer Diagnosis and Therapy

[0188] Polypeptides of the invention may be involved in cancer cell generation, proliferation or metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. For example, the presence or increased expression of a polynucleotide/polypeptide of the invention may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancer condition. Identification of single nucleotide polymorphisms associated with cancer or a predisposition to cancer may also be useful for diagnosis or prognosis.

[0189] Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor growth) and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. Therapeutic compositions of the invention may be effective in adult and pediatric oncology including in solid phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, acute and chronic leukemias, and lymphomas, head and neck cancers including mouth cancer, larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small cell cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic cancers including bladder cancer and prostate cancer, malignancies of the female genital tract including ovarian carcinoma, uterine (including endometrial) cancers, and solid tumor in the ovarian follicle, kidney cancers including renal cell carcinoma, brain cancers including intrinsic brain tumors, neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central nervous system, bone cancers including osteomas, skin cancers including malignant melanoma, tumor progression of human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, hemangiopericytoma and Karposi's sarcoma.

[0190] Polypeptides, polynucleotides, or modulators of polypeptides of the invention (including inhibitors and stimulators of the biological activity of the polypeptide of the invention) may be administered to treat cancer. Therapeutic compositions can be administered in therapeutically effective dosages alone or in combination with adjuvant cancer therapy such as surgery, chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial effect, e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or otherwise improving overall clinical condition, without necessarily eradicating the cancer.

[0191] The composition can also be administered in therapeutically effective amounts as a portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or modulator of the invention with one or more anti-cancer drugs in addition to a pharmaceutically acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer treatment is routine. Anti-cancer drugs that are well known in the art and can be used as a treatment in combination with the polypeptide or modulator of the invention include: Actinomycin D, Aminoglutethimide, Asparaginase, Bleomycin, Busulfan, Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis-DDP), Cyclophosphamide, Cytarabine HCl (Cytosine arabinoside), Dacarbazine, Dactinomycin, Daunorubicin HCl, Doxorubicin HCl, Estramustine phosphate sodium, Etoposide (V16-213), Floxuridine, 5-Fluorouracil (5-Fu), Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, Interferon Alpha-2a, Interferon Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), Lomustine, Mechlorethamine HCl (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, Methotrexate (MTX), Mitomycin, Mitoxantrone HCl, Octreotide, Plicamycin, Procarbazine HCl, Streptozocin, Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, Semustine, Teniposide, and Vindesine sulfate.

[0192] In addition, therapeutic compositions of the invention may be used for prophylactic treatment of cancer. There are hereditary conditions and/or environmental situations (e.g. exposure to carcinogens) known in the art that predispose an individual to developing cancers. Under these circumstances, it may be beneficial to treat these individuals with therapeutically effective doses of the polypeptide of the invention to reduce the risk of developing cancers.

[0193] In vitro models can be used to determine the effective doses of the polypeptide of the invention as a potential cancer treatment. These in vitro models include proliferation assays of cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1987) Culture of Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, N.Y. Ch 18 and Ch 21), tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst., 52: 921-30 (1974), mobility and invasive potential of tumor cells in Boyden Chamber assays as described in Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction of vascularization of the chick chorioallantoic membrane or induction of vascular endothelial cell migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1189-97 (1999) and Li et al., Clin. Exp. Metastasis, 17:423-9 (1999) respectively. Suitable tumor cells lines are available, e.g. from American Type Tissue Culture Collection catalogs.

4.8.12 Receptor/Ligand Activity

[0194] A polypeptide or protein of the present invention may also demonstrate activity as receptor, receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the invention can encode a polypeptide exhibiting such characteristics. Examples of such receptors and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions and their ligands (including without limitation, cellular adhesion molecules (such as selecting, integrins and their ligands) and receptor/ligand pairs involved in antigen presentation, antigen recognition and development of cellular and humoral immune responses. Receptors and ligands are also useful for screening of potential peptide or small molecule inhibitors of the relevant receptor/ligand interaction. A polypeptide or protein of the present invention (including, without limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of receptor/ligand interactions.

[0195] The activity of a polypeptide or protein of the invention may, among other means, be measured by the following methods:

[0196] Suitable assays for receptor-ligand activity include without limitation those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static conditions 7.28.1-7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., J. Exp. Med. 168:1145-1156, 1988; Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995.

[0197] By way of example, the polypeptides of the invention may be used as a receptor for a ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be identified through binding assays, affinity chromatography, dihybrid screening assays, BIAcore assays, gel overlay assays, or other methods known in the art.

[0198] Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a partial antagonist require the use of other proteins as competing ligands. The polypeptides of the present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, colorimetric molecules or a toxin molecules by conventional methods. (“Guide to Protein Purification” Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 (1990) Academic Press, Inc. San Diego). Examples of radioisotopes include, but are not limited to, tritium and carbon-14. Examples of colorimetric molecules include, but are not limited to, fluorescent molecules such as fluorescamine, or rhodamine or other calorimetric molecules. Examples of toxins include, but are not limited, to ricin.

4.8.13 Drug Screening

[0199] This invention is particularly useful for screening chemical compounds by using the novel polypeptides or binding fragments thereof in any of a variety of drug screening techniques. The polypeptides or fragments employed in such a test may either be free in solution, affixed to a solid support, borne on a cell surface or located intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant nucleic acids expressing the polypeptide or fragment. Drugs are screened against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be used for standard binding assays. One may measure, for example, the formation of complexes between polypeptides of the invention or fragments and the agent being tested or examine the diminution in complex formation between the novel polypeptides and an appropriate cell line, which are well known in the art.

[0200] Sources for test compounds that may be screened for ability to bind to or modulate (i.e., increase or decrease) the activity of polypeptides of the invention include (1) inorganic and organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries comprised of either random or mimetic peptides, oligonucleotides or organic molecules.

[0201] Chemical libraries may be readily synthesized or purchased from a number of commercial sources, and may include structural analogs of known compounds or compounds that are identified as “hits” or “leads” via natural product screening.

[0202] The sources of natural product libraries are microorganisms (including bacteria and fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for screening may be created by: (1) fermentation and extraction of broths from soil, plant or marine microorganisms or (2) extraction of the organisms themselves. Natural product libraries include polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a review, see Science 282:63-68 (1998).

[0203] Combinatorial libraries are composed of large numbers of peptides, oligonucleotides or organic compounds and can be readily prepared by traditional automated synthesis methods, PCR, cloning or proprietary synthetic methods. Of particular interest are peptide and oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide libraries. For a review of combinatorial chemistry and libraries created therefrom, see Myers, Curr. Opin. Biotechnol. 8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see Al-Obeidi et al., Mol. Biotechnol, 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol, 1(1):114-19 (1997); Dorner et al., Bioorg Med Chem, 4(5):709-15 (1996) (alkylated dipeptides).

[0204] Identification of modulators through use of the various libraries described herein permits modification of the candidate “hit” (or “lead”) to optimize the capacity of the “hit” to bind a polypeptide of the invention. The molecules identified in the binding assay are then tested for antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the art. In brief, the molecules are titrated into a plurality of cell cultures or animals and then tested for either cell/animal death or prolonged survival of the animal/cells.

[0205] The binding molecules thus identified may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to cells such as radioisotopes. The toxin-binding molecule complex is then targeted to a tumor or other cell by the specificity of the binding molecule for a polypeptide of the invention. Alternatively, the binding molecules may be complexed with imaging agents for targeting and imaging purposes.

4.8.14 Assay for Receptor Activity

[0206] The invention also provides methods to detect specific binding of a polypeptide e.g. a ligand or a receptor. The art provides numerous assays particularly useful for identifying previously unknown binding partners for receptor polypeptides of the invention. For example, expression cloning using mammalian or bacterial cells, or dihybrid screening assays can be used to identify polynucleotides encoding binding partners. As another example, affinity chromatography with the appropriate immobilized polypeptide of the invention can be used to isolate polypeptides that recognize and bind polypeptides of the invention (represented by the nucleotide or amino acid sequences set forth in the Sequence Listing). There are a number of different libraries used for the identification of compounds, and in particular small molecule, that modulate (i.e., increase or decrease) biological activity of a polypeptide of the invention. Ligands for receptor polypeptides of the invention can also be identified by adding exogenous ligands, or cocktails of ligands to two cells populations that are genetically identical except for the expression of the receptor of the invention: one cell population expresses the receptor of the invention whereas the other does not. The response of the two cell populations to the addition of ligands(s) are then compared. Alternatively, an expression library can be co-expressed with the polypeptide of the invention in cells and assayed for an autocrine response to identify potential ligand(s). As still another example, BIAcore assays, gel overlay assays, or other methods known in the art can be used to identify binding partner polypeptides, including, (1) organic and inorganic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries comprised of random peptides, oligonucleotides or organic molecules.

[0207] The role of downstream intracellular signaling molecules in the signaling cascade of the polypeptide of the invention can be determined. For example, a chimeric protein in which the cytoplasmic domain of the polypeptide of the invention is fused to the extracellular portion of a protein, whose ligand has been identified, is produced in a host cell. The cell is then incubated with the ligand specific for the extracellular portion of the chimeric protein, thereby activating the chimeric receptor. Known downstream proteins involved in intracellular signaling can then be assayed for expected modifications i.e. phosphorylation. Other methods known to those in the art can also be used to identify signaling molecules involved in receptor activity.

4.8.15 Anti-Inflammatory Activity

[0208] Compositions of the present invention may also exhibit anti-inflammatory activity. The anti-inflammatory activity may be achieved by providing a stimulus to cells involved in the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production of other factors which more directly inhibit or promote an inflammatory response. Compositions with such activities can be used to treat inflammatory conditions including chronic or acute conditions), including without limitation intimation associated with infection (such as septic shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from over production of cytokines such as TNF or IL-1. Compositions of the invention may also be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. Compositions of this invention may be utilized to prevent or treat condition such as, but not limited to, utilized, for example, as part of methods for the prevention and/or treatment of disorders involving sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, graft versus host disease, inflammatory bowel disease, inflamation associated with pulmonary disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to intrauterine infections.

4.8.16 Leukemias

[0209] Leukemias and related disorders may be treated or prevented by administration of a therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the invention. Such leukemias and related disorders include but are not limited to acute leukemia, acute lymphocytic leukemia, acute myelocytic leukemia, myeloblastic, promyelocytic, myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic myelocytic (granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see Fishman et al., 1985, Medicine, 2d Ed., J. B. Lippincott Co., Philadelphia).

4.8.17 Nervous System Disorders

[0210] Nervous system disorders, involving cell types which can be tested for efficacy of intervention with compounds that modulate the activity of the polynucleotides and/or polypeptides of the invention, and which can be treated upon thus observing an indication of therapeutic utility, include but are not limited to nervous system injuries, and diseases or disorders which result in either a disconnection of axons, a diminution or degeneration of neurons, or demyelination. Nervous system lesions which may be treated in a patient (including human and non-human mammalian patients) according to the invention include but are not limited to the following lesions of either the central (including spinal cord, brain) or peripheral nervous systems:

[0211] (i) traumatic lesions, including lesions caused by physical injury or associated with surgery, for example, lesions which sever a portion of the nervous system, or compression injuries;

[0212] (ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord infarction or ischemia;

[0213] (iii) infectious lesions, in which a portion of the nervous system is destroyed or injured as a result of infection, for example, by an abscess or associated with infection by human immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, tuberculosis, syphilis;

[0214] (iv) degenerative lesions, in which a portion of the nervous system is destroyed or injured as a result of a degenerative process including but not limited to degeneration associated with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral sclerosis;

[0215] (v) lesions associated with nutritional diseases or disorders, in which a portion of the nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism including but not limited to, vitamin B12 deficiency, folic acid deficiency, Wernicke disease, tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary degeneration of the corpus callosum), and alcoholic cerebellar degeneration;

[0216] (vi) neurological lesions associated with systemic diseases including but not limited to diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, carcinoma, or sarcoidosis;

[0217] (vii) lesions caused by toxic substances including alcohol, lead, or particular neurotoxins; and

[0218] (viii) demyelinated lesions in which a portion of the nervous system is destroyed or injured by a demyelinating disease including but not limited to multiple sclerosis, human immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, progressive multifocal leukoencephalopathy, and central pontine myelinolysis.

[0219] Therapeutics which are useful according to the invention for treatment of a nervous system disorder may be selected by testing for biological activity in promoting the survival or differentiation of neurons. For example, and not by way of limitation, therapeutics which elicit any of the following effects may be useful according to the invention:

[0220] (i) increased survival time of neurons in culture;

[0221] (ii) increased sprouting of neurons in culture or in vivo;

[0222] (iii) increased production of a neuron-associated molecule in culture or in vivo, e.g., choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or

[0223] (iv) decreased symptoms of neuron dysfunction in vivo.

[0224] Such effects may be measured by any method known in the art. In preferred, non-limiting embodiments, increased survival of neurons may be measured by the method set forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or Brown et al. (1981, Ann. Rev. Neurosci. 4:1742); increased production of neuron-associated molecules may be measured by bioassay, enzymatic assay, antibody binding, Northern blot assay, etc., depending on the molecule to be measured; and motor neuron dysfunction may be measured by assessing the physical manifestation of motor neuron disorder, e.g., weakness, motor neuron conduction velocity, or functional disability.

[0225] In specific embodiments, motor neuron disorders that may be treated according to the invention include but are not limited to disorders such as infarction, infection, exposure to toxin, trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as well as other components of the nervous system, as well as disorders that selectively affect neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile muscular atrophy, progressive bulbar paralysis of childhood (Fazio-Londe syndrome), poliomyelitis and the post polio syndrome, and Hereditary Motorsensory Neuropathy (Charcot-Marie-Tooth Disease).

4.8.18 Arthritis and Inflammation

[0226] The immunosuppressive effects of the compositions of the invention against rheumatoid arthritis is determined in an experimental animal model system. The experimental model system is adjuvant induced arthritis in rats, and the protocol is described by J. Holoshitz, et at., 1983, Science, 219:56, or by B. Waksman et al., 1963, Int. Arch. Allergy Appl. Immunol., 23:129. Induction of the disease can be caused by a single injection, generally intradermally, of a suspension of killed Mycobacterium tuberculosis in complete Freund's adjuvant (CFA). The route of injection can vary, but rats may be injected at the base of the tail with an adjuvant mixture. The inhibitor is administered in phosphate buffered solution (PBS) at a dose of about 1-5 mg/kg. The control consists of administering PBS only.

[0227] The procedure for testing the effects of the test compound would consist of intradermally injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the inhibitor and subsequent treatment every other day until day 24. At 14, 15, 18, 20, 22, and 24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as described by J. Holoskitz above. An analysis of the data would reveal that the test compound would have a dramatic affect on the swelling of the joints as measured by a decrease of the arthritis score.

4.8.19 Other Activities

[0228] A polypeptide or protein of the invention may also exhibit one or more of the following additional activities or effects: inhibiting the growth, infection or function of, or killing, infectious agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape (such as, for example, breast augmentation or diminution, change in bone form or shape); effecting biorhythms or circadian cycles or rhythms; effecting the fertility of male or female subjects; effecting the metabolism, catabolism, anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other nutritional factors or component(s); effecting behavioral characteristics, including, without limitation, appetite, libido, stress, cognition (including cognitive disorders), depression (including depressive disorders) and violent behaviors; providing analgesic effects or other pain reducing effects; promoting differentiation and growth of embryonic stem cells in lineages other than hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies of the enzyme and treating deficiency-related diseases; treatment of hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for example, the ability to bind antigens or complement); and the ability to act as an antigen in a vaccine composition to raise an immune response against such protein or another material or entity which is cross-reactive with such protein.

4.8.20 Identification of Polymorphisms

[0229] The demonstration of polymorphisms makes possible the identification of such polymorphisms in human subjects and the pharmacogenetic use of this information for diagnosis and treatment. Such polymorphisms may be associated with, e.g., differential predisposition or susceptibility to various disease states (such as disorders involving inflammation or immune response) or a differential response to drug administration, and this genetic information can be used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a polymorphism associated with a predisposition to inflammation or autoimmune disease makes possible the diagnosis of this condition in humans by identifying the presence of the polymorphism.

[0230] Polymorphisms can be identified in a variety of ways known in the art which all generally involve obtaining a sample from a patient, analyzing DNA from the sample, optionally involving isolation or amplification of the DNA, and identifying the presence of the polymorphism in the DNA. For example, PCR may be used to amplify an appropriate fragment of genomic DNA which may then be sequenced. Alternatively, the DNA may be subjected to allele-specific oligonucleotide hybridization (in which appropriate oligonucleotides are hybridized to the DNA under conditions permitting detection of a single base mismatch) or to a single nucleotide extension assay (in which an oligonucleotide that hybridizes immediately adjacent to the position of the polymorphism is extended with one or more labeled nucleotides). In addition, traditional restriction fragment length polymorphism analysis (using restriction enzymes that provide differential digestion of the genomic DNA depending on the presence or absence of the polymorphism) may be performed. Arrays with nucleotide sequences of the present invention can be used to detect polymorphisms. The array can comprise modified nucleotide sequences of the present invention in order to detect the nucleotide sequences of the present invention. In the alternative, any one of the nucleotide sequences of the present invention can be placed on the array to detect changes from those sequences.

[0231] Alternatively a polymorphism resulting in a change in the amino acid sequence could also be detected by detecting a corresponding change in amino acid sequence of the protein, e.g., by an antibody specific to the variant sequence.

4.9 Therapeutic Methods

[0232] The compositions (including polypeptide fragments, analogs, variants and antibodies or other binding partners or modulators including antisense polynucleotides) of the invention have numerous applications in a variety of therapeutic methods. Examples of therapeutic applications include, but are not limited to, those exemplified herein.

4.9.1 Examples

[0233] Another embodiment of the invention is the administration of an effective amount of the polypeptide or other composition of the invention to individuals affected by a disease or disorder which can be modulated by regulating the IgSF member of the invention. While the mode of administration is not particularly important, parenteral administration is preferred. An exemplary mode of administration is to deliver an intravenous bolus. The dosage of the polypeptide or composition of the invention will normally be determined by the prescribing physician. It is to be expected that the dosage will vary according to the age, weight, condition and response of the individual patient. Typically, the amount of protein or other active ingredient administered per dose will be in the range of about 0.1 to 25 mg/kg of body weight, with the preferred dose being about 0.1 to 10 mg/kg of patient body weight. For parenteral administration, the polypeptides or other active ingredient of the invention will be formulated in an injectable form that includes a pharmaceutically acceptable parenteral vehicle. Such vehicles are well known in the art and examples include water, saline, Ringer's solution, dextrose solution, and solutions consisting of small amounts of the human serum albumin. The vehicle may contain minor amounts of additives that maintain the isotonicity and stability of the polypeptide or other active ingredient. The preparation of such solutions is within the skill of the art. Typically, the cytokine inhibitor will be formulated in such vehicles at a concentration of about 1-8 mg/ml to about 10 mg/ml.

4.10 Pharmaceutical Formulations and Routes of Administration

[0234] A polypeptide or other composition of the present invention (from whatever source derived, including without limitation from recombinant and non-recombinant sources and including antibodies and other binding partners of the polypeptides of the invention) may be administered to a patient in need, by itself, or in pharmaceutical compositions where it is mixed with suitable carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term “pharmaceutically acceptable” means a non-toxic material that does not interfere with the effectiveness of the biological activity of the active ingredient(s). The characteristics of the carrier will depend on the route of administration. The pharmaceutical composition of the invention may also contain cytokines, lymphokines, or other hematopoietic factors such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IFN, TNF0, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell factor, and erythropoietin. In further compositions, polypeptides or proteins of the invention may be combined with other agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in questions. These agents include various growth factors such as epidermal growth factor (EGF), platelet-derived growth factor (PDGF), transforming growth factors (TGF-α and TGF-β), insulin-like growth factor (IGF), as well as cytokines described herein.

[0235] The pharmaceutical composition may further contain other agents which either enhance the activity of the protein or other active ingredient or compliment its activity or use in treatment. Such additional factors and/or agents may be included in the pharmaceutical composition to produce a synergistic effect with protein or other active ingredient of the invention, or to minimize side effects. Conversely, protein or other active ingredient of the present invention may be included in formulations of the particular cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-inflammatory agent to minimize side effects of the cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-inflammatory agent. A polypeptide or protein of the present invention may be active in multimers (e.g., heterodimers or homodimers) or complexes with itself or other proteins. As a result, pharmaceutical compositions of the invention may comprise a polypeptide or protein of the invention in such multimeric or complexed form.

[0236] As an alternative to being included in a pharmaceutical composition of the invention including a first protein, a second protein or a therapeutic agent may be concurrently administered with the first protein (e.g., at the same time, or at differing times provided that therapeutic concentrations of the combination of agents is achieved at the treatment site). Techniques for formulation and administration of the compounds of the instant application may be found in “Remington's Pharmaceutical Sciences,” Mack Publishing Co., Easton, Pa., latest edition. A therapeutically effective dose further refers to that amount of the compound sufficient to result in amelioration of symptoms, e.g., treatment, healing, prevention or amelioration of the relevant medical condition, or an increase in rate of treatment, healing, prevention or amelioration of such conditions. When applied to an individual active ingredient, administered alone, a therapeutically effective dose refers to that ingredient alone. When applied to a combination, a therapeutically effective dose refers to combined amounts of the active ingredients that result in the therapeutic effect, whether administered in combination, serially or simultaneously.

[0237] In practicing the method of treatment or use of the present invention, a therapeutically effective amount of protein or other active ingredient of the present invention is administered to a mammal having a condition to be treated. Polypeptide or other active ingredient of the present invention may be administered in accordance with the method of the invention either alone or in combination with other therapies such as treatments employing cytokines, lymphokines or other hematopoietic factors. When co-administered with one or more cytokines, lymphokines or other hematopoietic factors, protein or other active ingredient of the present invention may be administered either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic factors, or sequentially. If administered sequentially, the attending physician will decide on the appropriate sequence of administering protein or other active ingredient of the present invention in combination with cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic factors.

4.10.1. Routes of Administration

[0238] Suitable routes of administration may, for example, include oral, rectal, transmucosal, or intestinal administration; parenteral delivery, including intramuscular, subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal, or intraocular injections. Administration of protein or other active ingredient of the present invention used in the pharmaceutical composition or to practice the method of the present invention can be carried out in a variety of conventional ways, such as oral ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. Intravenous administration to the patient is preferred.

[0239] Alternately, one may administer the compound in a local rather than systemic manner, for example, via injection of the compound directly into a arthritic joints or in fibrotic tissue, often in a depot or sustained release formulation. In order to prevent the scarring process frequently occurring as complication of glaucoma surgery, the compounds may be administered topically, for example, as eye drops. Furthermore, one may administer the drug in a targeted drug delivery system, for example, in a liposome coated with a specific antibody, targeting, for example, arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the afflicted tissue.

4.10.2. Compositions/Formulations

[0240] Pharmaceutical compositions for use in accordance with the present invention thus may be formulated in a conventional manner using one or more physiologically acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically. These pharmaceutical compositions may be manufactured in a manner that is itself known, e.g., by means of conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or lyophilizing processes. Proper formulation is dependent upon the route of administration chosen. When a therapeutically effective amount of protein or other active ingredient of the present invention is administered orally, protein or other active ingredient of the present invention will be in the form of a tablet, capsule, powder, solution or elixir. When administered in tablet form, the pharmaceutical composition of the invention may additionally contain a solid carrier such as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 95% protein or other active ingredient of the present invention, and preferably from about 25 to 90% protein or other active ingredient of the present invention. When administered in liquid form, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the pharmaceutical composition may further contain physiological saline solution, dextrose or other saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. When administered in liquid form, the pharmaceutical composition contains from about 0.5 to 90% by weight of protein or other active ingredient of the present invention, and preferably from about 1 to 50% protein or other active ingredient of the present invention.

[0241] When a therapeutically effective amount of protein or other active ingredient of the present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or other active ingredient of the present invention will be in the form of a pyrogen-free, parenterally acceptable aqueous solution. The preparation of such parenterally acceptable protein or other active ingredient solutions, having due regard to pH, isotonicity, stability, and the like, is within the skill in the art. A preferred pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection should contain, in addition to protein or other active ingredient of the present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or other vehicle as known in the art. The pharmaceutical composition of the present invention may also contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of skill in the art. For injection, the agents of the invention may be formulated in aqueous solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's solution, or physiological saline buffer. For transmucosal administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art.

[0242] For oral administration, the compounds can be formulated readily by combining the active compounds with pharmaceutically acceptable carriers well known in the art. Such carriers enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be treated. Pharmaceutical preparations for oral use can be obtained solid excipient, optionally grinding a resulting mixture, and processing the mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as sodium alginate. Dragee cores are provided with suitable coatings. For this purpose, concentrated sugar solutions may be used, which may optionally contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for identification or to characterize different combinations of active compound doses.

[0243] Pharmaceutical preparations which can be used orally include push-fit capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, stabilizers may be added. All formulations for oral administration should be in dosages suitable for such administration. For buccal administration, the compositions may take the form of tablets or lozenges formulated in conventional manner.

[0244] For administration by inhalation, the compounds for use according to the present invention are conveniently delivered in the form of an aerosol spray presentation from pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined by providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin for use in an inhaler or insufflator may be formulated containing a powder mix of the compound and a suitable powder base such as lactose or starch. The compounds may be formulated for parenteral administration by injection, e.g., by bolus injection or continuous infusion. Formulations for injection may be presented in unit dosage form, e.g., in ampules or in multi-dose containers, with an added preservative. The compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents.

[0245] Pharmaceutical formulations for parenteral administration include aqueous solutions of the active compounds in water-soluble form. Additionally, suspensions of the active compounds may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Aqueous injection suspensions may contain substances which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. Optionally, the suspension may also contain suitable stabilizers or agents which increase the solubility of the compounds to allow for the preparation of highly concentrated solutions. Alternatively, the active ingredient may be in powder form for constitution with a suitable vehicle, e.g., sterile pyrogen-free water, before use.

[0246] The compounds may also be formulated in rectal compositions such as suppositories or retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other glycerides. In addition to the formulations described previously, the compounds may also be formulated as a depot preparation. Such long acting formulations may be administered by implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt.

[0247] A pharmaceutical carrier for the hydrophobic compounds of the invention is a cosolvent system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer, and an aqueous phase. The cosolvent system may be the VPD co-solvent system. VPD is a solution of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, and 65% w/v polyethylene glycol 300, made up to volume in absolute ethanol. The VPD co-solvent system (VPD:5W) consists of VPD diluted 1:1 with a 5% dextrose in water solution. This co-solvent system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic administration. Naturally, the proportions of a co-solvent system may be varied considerably without destroying its solubility and toxicity characteristics. Furthermore, the identity of the co-solvent components may be varied: for example, other low-toxicity nonpolar surfactants may be used instead of polysorbate 80; the fraction size of polyethylene glycol may be varied; other biocompatible polymers may replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and other sugars or polysaccharides may substitute for dextrose. Alternatively, other delivery systems for hydrophobic pharmaceutical compounds may be employed. Liposomes and emulsions are well known examples of delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. Additionally, the compounds may be delivered using a sustained-release system, such as semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. Various types of sustained-release materials have been established and are well known by those skilled in the art. Sustained-release capsules may, depending on their chemical nature, release the compounds for a few weeks up to over 100 days. Depending on the chemical nature and the biological stability of the therapeutic reagent, additional strategies for protein or other active ingredient stabilization may be employed.

[0248] The pharmaceutical compositions also may comprise suitable solid or gel phase carriers or excipients. Examples of such carriers or excipients include but are not limited to calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and polymers such as polyethylene glycols. Many of the active ingredients of the invention may be provided as salts with pharmaceutically compatible counterions. Such pharmaceutically acceptable base addition salts are those salts which retain the biological effectiveness and properties of the free acids and which are obtained by reaction with inorganic or organic bases such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, dialkylamine, monoalkylamine, dibasic amino acids, sodium acetate, potassium benzoate, triethanol amine and the like.

[0249] The pharmaceutical composition of the invention may be in the form of a complex of the polypeptide(s) or protein(s) or other active ingredient of present invention along with protein or peptide antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T lymphocytes. B lymphocytes will respond to antigen through their surface immunoglobulin receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) following presentation of the antigen by MHC proteins. MHC and structurally related proteins including those encoded by class I and class II MHC genes on host cells will serve to present the peptide antigen(s) to T lymphocytes. The antigen components could also be supplied as purified MHC-peptide complexes alone or with co-stimulatory molecules that can directly signal T cells. Alternatively antibodies able to bind surface immunoglobulin and other molecules on B cells as well as antibodies able to bind the TCR and other molecules on T cells can be combined with the pharmaceutical composition of the invention. The pharmaceutical composition of the invention may be in the form of a liposome in which polypeptide or protein of the present invention is combined, in addition to other pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the like. Preparation of such liposomal formulations is within the level of skill in the art, as disclosed, for example, in U.S. Pat. Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of which are incorporated herein by reference.

[0250] The amount of protein or other active ingredient of the present invention in the pharmaceutical composition of the present invention will depend upon the nature and severity of the condition being treated, and on the nature of prior treatments which the patient has undergone. Ultimately, the attending physician will decide the amount of protein or other active ingredient of the present invention with which to treat each individual patient. Initially, the attending physician will administer low doses of protein or other active ingredient of the present invention and observe the patient's response. Larger doses of protein or other active ingredient of the present invention may be administered until the optimal therapeutic effect is obtained for the patient, and at that point the dosage is not increased further. It is contemplated that the various pharmaceutical compositions used to practice the method of the present invention should contain about 0.01 μg to about 100 mg (preferably about 0.1 μg to about 10 mg, more preferably about 0.1 μg to about 1 mg) of protein or other active ingredient of the present invention per kg body weight. For compositions of the present invention which are useful for bone, cartilage, tendon or ligament regeneration, the therapeutic method includes administering the composition topically, systematically, or locally as an implant or device. When administered, the therapeutic composition for use in this invention is, of course, in a pyrogen-free, physiologically acceptable form. Further, the composition may desirably be encapsulated or injected in a viscous form for delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable for wound healing and tissue repair. Therapeutically useful agents other than a protein or other active ingredient of the invention which may also optionally be included in the composition as described above, may alternatively or additionally, be administered simultaneously or sequentially with the composition in the methods of the invention. Preferably for bone and/or cartilage formation, the composition would include a matrix capable of delivering the protein-containing or other active ingredient-containing composition to the site of bone and/or cartilage damage, providing a structure for the developing bone and cartilage and optimally capable of being resorbed into the body. Such matrices may be formed of materials presently in use for other implanted medical applications.

[0251] The choice of matrix material is based on biocompatibility, biodegradability, mechanical properties, cosmetic appearance and interface properties. The particular application of the compositions will define the appropriate formulation. Potential matrices for the compositions may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. Other potential materials are biodegradable and biologically well-defined, such as bone or dermal collagen. Further matrices are comprised of pure proteins or extracellular matrix components. Other potential matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and tricalcium phosphate. The bioceramics may be altered in composition, such as in calcium-aluminate-phosphate and processing to alter pore size, particle size, particle shape, and biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and glycolic acid in the form of porous particles having diameters ranging from 150 to 800 microns. In some applications, it will be useful to utilize a sequestering agent, such as carboxymethyl cellulose or autologous blood clot, to prevent the protein compositions from disassociating from the matrix.

[0252] A preferred family of sequestering agents is cellulosic materials such as alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl-methylcellulose, and carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose (CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and poly(vinyl alcohol). The amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-10 wt % based on total formulation weight, which represents the amount necessary to prevent desorption of the protein from the polymer matrix and to provide appropriate handling of the composition, yet not so much that the progenitor cells are prevented from infiltrating the matrix, thereby providing the protein the opportunity to assist the osteogenic activity of the progenitor cells. In further compositions, proteins or other active ingredient of the invention may be combined with other agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in question. These agents include various growth factors such as epidermal growth factor (EGF), platelet derived growth factor (PDGF), transforming growth factors (TGF-α and TGF-β), and insulin-like growth factor (IGF).

[0253] The therapeutic compositions are also presently valuable for veterinary applications. Particularly domestic animals and thoroughbred horses, in addition to humans, are desired patients for such treatment with proteins or other active ingredient of the present invention. The dosage regimen of a protein-containing pharmaceutical composition to be used in tissue regeneration will be determined by the attending physician considering various factors which modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site of damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (e.g., bone), the patient's age, sex, and diet, the severity of any infection, time of administration and other clinical factors. The dosage may vary with the type of matrix used in the reconstitution and with inclusion of other proteins in the pharmaceutical composition. For example, the addition of other known growth factors, such as IGF I (insulin like growth factor I), to the final composition, may also effect the dosage. Progress can be monitored by periodic assessment of tissue/bone growth and/or repair, for example, X-rays, histomorphometric determinations and tetracycline labeling.

4.10.3. Effective Dosage

[0254] Pharmaceutical compositions suitable for use in the present invention include compositions wherein the active ingredients are contained in an effective amount to achieve its intended purpose. More specifically, a therapeutically effective amount means an amount effective to prevent development of or to alleviate the existing symptoms of the subject being treated. Determination of the effective amount is well within the capability of those skilled in the art, especially in light of the detailed disclosure provided herein. For any compound used in the method of the invention, the therapeutically effective dose can be estimated initially from appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a circulating concentration range that can be used to more accurately determine useful doses in humans. For example, a dose can be formulated in animal models to achieve a circulating concentration range that includes the IC₅₀ as determined in cell culture (i.e., the concentration of the test compound which achieves a half-maximal inhibition of the protein's biological activity). Such information can be used to more accurately determine useful doses in humans.

[0255] A therapeutically effective dose refers to that amount of the compound that results in amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic efficacy of such compounds can be determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD₅₀ (the dose lethal to 50% of the population) and the ED₅₀ (the dose therapeutically effective in 50% of the population). The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the ratio between LD₅₀ and ED₅₀. Compounds which exhibit high therapeutic indices are preferred. The data obtained from these cell culture assays and animal studies can be used in formulating a range of dosage for use in human. The dosage of such compounds lies preferably within a range of circulating concentrations that include the ED₅₀ with little or no toxicity. The dosage may vary within this range depending upon the dosage form employed and the route of administration utilized. The exact formulation, route of administration and dosage can be chosen by the individual physician in view of the patient's condition. See, e.g., Fingl et al., 1975, in “The Pharmacological Basis of Therapeutics”, Ch. 1 p. 1. Dosage amount and interval may be adjusted individually to provide plasma levels of the active moiety which are sufficient to maintain the desired effects, or minimal effective concentration (MEC). The MEC will vary for each compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC will depend on individual characteristics and route of administration. However, HPLC assays or bioassays can be used to determine plasma concentrations.

[0256] Dosage intervals can also be determined using MEC value. Compounds should be administered using a regimen which maintains plasma levels above the MEC for 10-90% of the time, preferably between 30-90% and most preferably between 50-90%. In cases of local administration or selective uptake, the effective local concentration of the drug may not be related to plasma concentration.

[0257] An exemplary dosage regimen for polypeptides or other compositions of the invention will be in the range of about 0.01 to 100 mg/kg of body weight daily, with the preferred dose being about 0.1 to 25 mg/kg of patient body weight daily, varying in adults and children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter intervals.

[0258] The amount of composition administered will, of course, be dependent on the subject being treated, on the subject's age and weight, the severity of the affliction, the manner of administration and the judgment of the prescribing physician.

4.10.4. Packaging

[0259] The compositions may, if desired, be presented in a pack or dispenser device which may contain one or more unit dosage forms containing the active ingredient. The pack may, for example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may be accompanied by instructions for administration. Compositions comprising a compound of the invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an appropriate container, and labeled for treatment of an indicated condition.

4.11. Antibodies

[0260] Another aspect of the invention is an antibody that specifically binds the polypeptide of the invention. Such antibodies include monoclonal and polyclonal antibodies, single chain antibodies, chimeric antibodies, bifunctional/bispecific antibodies, humanized antibodies, human antibodies, and complementary determining region (CDR)-grafted antibodies, including compounds which include CDR and/or antigen-binding sequences, which specifically recognize a polypeptide of the invention. Preferred antibodies of the invention are human antibodies which are produced and identified according to methods described in WO93/11236, published Jun. 20, 1993, which is incorporated herein by reference in its entirety. Antibody fragments, including Fab, Fab′, F(ab′)₂, and F_(v), are also provided by the invention. The term “specific for” indicates that the variable regions of the antibodies of the invention recognize and bind polypeptides of the invention exclusively (i.e., able to distinguish the polypeptide of the invention from other similar polypeptides despite sequence identity, homology, or similarity found in the family of polypeptides), but may also interact with other proteins (for example, S. aureus protein A or other antibodies in ELISA techniques) through interactions with sequences outside the variable region of the antibodies, and in particular, in the constant region of the molecule. Screening assays to determine binding specificity of an antibody of the invention are well known and routinely practiced in the art. For a comprehensive discussion of such assays, see Harlow et al. (Eds), Antibodies A Laboratory Manual; Cold Spring Harbor Laboratory; Cold Spring Harbor, N.Y. (1988), Chapter 6. Antibodies that recognize and bind fragments of the polypeptides of the invention are also contemplated, provided that the antibodies are first and foremost specific for, as defined above, full length polypeptides of the invention. As with antibodies that are specific for full length polypeptides of the invention, antibodies of the invention that recognize fragments are those which can distinguish polypeptides from the same family of polypeptides despite inherent sequence identity, homology, or similarity found in the family of proteins. As with antibodies that are specific for full length polypeptides of the invention, antibodies of the invention that recognize polypeptides of the invention from its the family polypeptides despite inherent sequence identity, homology, or similarity found in the family of proteins. Antibodies of the invention can be produced using any method well known and routinely practiced in the art.

[0261] Non-human antibodies may be humanized by any methods known in the art. In one method, the non-human CDRs are inserted into a human antibody or consensus antibody framework sequence. Further changes can then be introduced into the antibody framework to modulate affinity or immunogenicity.

[0262] Antibodies of the invention are useful for, for example, therapeutic purposes (by modulating activity of a polypeptide of the invention), diagnostic purposes to detect or quantitate a polypeptide of the invention, as well as purification of a polypeptide of the invention. Kits comprising an antibody of the invention for any of the purposes described herein are also comprehended. In general, a kit of the invention also includes a control antigen for which the antibody is immunospecific. The invention further provides a hybridoma that produces an antibody according to the invention. Antibodies of the invention are useful for detection and/or purification of the polypeptides of the invention.

[0263] Polypeptides or proteins of the invention may also be used to immunize animals to obtain polyclonal and monoclonal antibodies which specifically react with the protein. Such antibodies may be obtained using either the entire protein or fragments thereof as an immunogen. The peptide immunogens additionally may contain a cysteine residue at the carboxyl terminus, and are conjugated to a hapten such as keyhole limpet hemocyanin (KLH). Methods for synthesizing such peptides are known in the art, for example, as in R. P. Merrifield, J. Amer. Chem. Soc. 85, 2149-2154 (1963); J. L. Krstenansky, et al., FEBS Lett. 211, 10 (1987).

[0264] Monoclonal antibodies binding to the protein or polypeptide of the invention may be useful diagnostic agents for the immunodetection of the protein or polypeptide. Neutralizing monoclonal antibodies binding to the polypeptide may also be useful therapeutics for both conditions associated with the polypeptide and also in the treatment of some forms of cancer where abnormal expression of the polypeptide is involved. In the case of cancerous cells or leukemic cells, neutralizing monoclonal antibodies against the polypeptide may be useful in detecting and preventing the metastatic spread of the cancerous cells, which may be mediated by the polypeptide. In general, techniques for preparing polyclonal and monoclonal antibodies as well as hybridomas capable of producing the desired antibody are well known in the art (Campbell, A. M., Monoclonal Antibodies Technology: Laboratory Techniques in Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1984); St. Groth et al., J. Immunol. 35:1-21 (1990); Kohler and Milstein, Nature 256:495-497 (1975)), the trioma technique, the human B-cell hybridoma technique (Kozbor et al., Immunology Today 4:72 (1983); Cole et al., in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. (1985), pp. 77-96).

[0265] Any animal (mouse, rabbit, etc.) which is known to produce antibodies can be immunized with a peptide or polypeptide of the invention. Methods for immunization are well known in the art. Such methods include subcutaneous or intraperitoneal injection of the polypeptide. One skilled in the art will recognize that the amount of the protein encoded by the ORF of the present invention used for immunization will vary based on the animal which is immunized, the antigenicity of the peptide and the site of injection. The protein that is used as an immunogen may be modified or administered in an adjuvant in order to increase the protein's antigenicity. Methods of increasing the antigenicity of a protein are well known in the art and include, but are not limited to, coupling the antigen with a heterologous protein (such as globulin or β-galactosidase) or through the inclusion of an adjuvant during immunization.

[0266] For monoclonal antibodies, spleen cells from the immunized animals are removed, fused with myeloma cells, such as SP2/0-Ag14 myeloma cells, and allowed to become monoclonal antibody producing hybridoma cells. Any one of a number of methods well known in the art can be used to identify the hybridoma cell which produces an antibody with the desired characteristics. These include screening the hybridomas with an ELISA assay, Western blot analysis, or radioimmunoassay (Lutz et al., Exp. Cell Research. 175:109-124 (1988)). Hybridomas secreting the desired antibodies are cloned and the class and subclass is determined using procedures known in the art (Campbell, A. M., Monoclonal Antibody Technology: Laboratory Techniques in Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1984)). Techniques described for the production of single chain antibodies (U.S. Pat. 4,946,778) can be adapted to produce single chain antibodies to proteins of the present invention.

[0267] For polyclonal antibodies, antibody-containing antiserum is isolated from the immunized animal and is screened for the presence of antibodies with the desired specificity using one of the above-described procedures. The present invention further provides the above-described antibodies in delectably labeled form. Antibodies can be delectably labeled through the use of radioisotopes, affinity labels (such as biotin, avidin, etc.), enzymatic labels (such as horseradish peroxidase, alkaline phosphatase, etc.) fluorescent labels (such as FITC or rhodamine, etc.), paramagnetic atoms, etc. Procedures for accomplishing such labeling are well-known in the art, for example, see (Sternberger, L. A. et al., J. Histochem. Cytochem. 18:315 (1970); Bayer, E. A. et al., Meth. Enzym. 62:308 (1979); Engval, E. et al., Immunol. 109:129 (1972); Goding, J. W. J. Immunol. Meth. 13:215 (1976)).

[0268] The labeled antibodies of the present invention can be used for in vitro, in vivo, and in situ assays to identify cells or tissues in which a fragment of the polypeptide of interest is expressed. The antibodies may also be used directly in therapies or other diagnostics. The present invention further provides the above-described antibodies immobilized on a solid support. Examples of such solid supports include plastics such as polycarbonate, complex carbohydrates such as agarose and Sepharose®, acrylic resins and such as polyacrylamide and latex beads. Techniques for coupling antibodies to such solid supports are well known in the art (Weir, D. M. et al., “Handbook of Experimental Immunology” 4th Ed., Blackwell Scientific Publications, Oxford, England, Chapter 10 (1986); Jacoby, W. D. et al., Meth. Enzym. 34 Academic Press, N.Y. (1974)). The immobilized antibodies of the present invention can be used for in vitro, in vivo, and in situ assays as well as for immuno-affinity purification of the polypeptides or proteins of the present invention.

4.12. Computer Readable Sequences

[0269] In one application of this embodiment, a nucleotide sequence of the present invention can be recorded on computer readable media. As used herein, “computer readable media” refers to any medium which can be read and accessed directly by a computer. Such media include, but are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these categories such as magnetic/optical storage media. A skilled artisan can readily appreciate how any of the presently known computer readable mediums can be used to create a manufacture comprising computer readable medium having recorded thereon a nucleotide sequence of the present invention. As used herein, “recorded” refers to a process for storing information on computer readable medium. A skilled artisan can readily adopt any of the presently known methods for recording information on computer readable medium to generate manufactures comprising the nucleotide sequence information of the present invention.

[0270] A variety of data storage structures are available to a skilled artisan for creating a computer readable medium having recorded thereon a nucleotide sequence of the present invention. The choice of the data storage structure will generally be based on the means chosen to access the stored information. In addition, a variety of data processor programs and formats can be used to store the nucleotide sequence information of the present invention on computer readable medium. The sequence information can be represented in a word processing text file, formatted in commercially-available software such as WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, Oracle, or the like. A skilled artisan can readily adapt any number of data processor structuring formats (e.g. text file or database) in order to obtain computer readable medium having recorded thereon the nucleotide sequence information of the present invention.

[0271] By providing any of the nucleotide sequences designated as SEQ ID NO: 1-55 or a representative fragment thereof; or a nucleotide sequence at least 99.9% identical to any of the nucleotide sequences of SEQ ID NO: 1-55 in computer readable form, a skilled artisan can routinely access the sequence information for a variety of purposes. Computer software is publicly available which allows a skilled artisan to access sequence information provided in a computer readable medium. The examples which follow demonstrate how software which implements the BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. Chem. 17:203-207 (1993)) search algorithms on a Sybase system is used to identify open reading frames (ORFs) within a nucleic acid sequence. Such ORFs may be protein encoding fragments and may be useful in producing commercially important proteins such as enzymes used in fermentation reactions and in the production of commercially useful metabolites.

[0272] As used herein, “a computer-based system” refers to the hardware means, software means, and data storage means used to analyze the nucleotide sequence information of the present invention. The minimum hardware means of the computer-based systems of the present invention comprises a central processing unit (CPU), input means, output means, and data storage means. A skilled artisan can readily appreciate that any one of the currently available computer-based systems are suitable for use in the present invention. As stated above, the computer-based systems of the present invention comprise a data storage means having stored therein a nucleotide sequence of the present invention and the necessary hardware means and software means for supporting and implementing a search means. As used herein, “data storage means” refers to memory which can store nucleotide sequence information of the present invention, or a memory access means which can access manufactures having recorded thereon the nucleotide sequence information of the present invention.

[0273] As used herein, “search means” refers to one or more programs which are implemented on the computer-based system to compare a target sequence or target structural motif with the sequence information stored within the data storage means. Search means are used to identify fragments or regions of a known sequence which match a particular target sequence or target motif. A variety of known algorithms are disclosed publicly and a variety of commercially available software for conducting search means are and can be used in the computer-based systems of the present invention. Examples of such software includes, but is not limited to, Smith-Waterman, MacPattern (EMBL), BLASTN and BLASTA (NPOLYPEPTIDEIA). A skilled artisan can readily recognize that any one of the available algorithms or implementing software packages for conducting homology searches can be adapted for use in the present computer-based systems. As used herein, a “target sequence” can be any nucleic acid or amino acid sequence of six or more nucleotides or two or more amino acids. A skilled artisan can readily recognize that the longer a target sequence is, the less likely a target sequence will be present as a random occurrence in the database. The most preferred sequence length of a target sequence is from about 10 to 100 amino acids or from about 30 to 300 nucleotide residues. However, it is well recognized that searches for commercially important fragments, such as sequence fragments involved in gene expression and protein processing, may be of shorter length.

[0274] As used herein, “a target structural motif,” or “target motif,” refers to any rationally selected sequence or combination of sequences in which the sequence(s) are chosen based on a three-dimensional configuration which is formed upon the folding of the target motif. There are a variety of target motifs known in the art. Protein target motifs include, but are not limited to, enzyme active sites and signal sequences. Nucleic acid target motifs include, but are not limited to, promoter sequences, hairpin structures and inducible expression elements (protein binding sequences).

4.13. Triple Helix Formation

[0275] In addition, the fragments of the present invention, as broadly described, can be used to control gene expression through triple helix formation or antisense DNA or RNA, both of which methods are based on the binding of a polynucleotide sequence to DNA or RNA. Polynucleotides suitable for use in these methods are usually 20 to 40 bases in length and are designed to be complementary to a region of the gene involved in transcription (triple helix—see Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 15241:456 (1988); and Dervan et al., Science 251:1360 (1991)) or to the mRNA itself (antisense—Olmno, J. Neurochem. 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, Fla. (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into polypeptide. Both techniques have been demonstrated to be effective in model systems. Information contained in the sequences of the present invention is necessary for the design of an antisense or triple helix oligonucleotide.

4.14. Diagnostic Assays and Kits

[0276] The present invention further provides methods to identify the presence or expression of one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic acid probe or antibodies of the present invention, optionally conjugated or otherwise associated with a suitable label.

[0277] In general, methods for detecting a polynucleotide of the invention can comprise contacting a sample with a compound that binds to and forms a complex with the polynucleotide for a period sufficient to form the complex, and detecting the complex, so that if a complex is detected, a polynucleotide of the invention is detected in the sample. Such methods can also comprise contacting a sample under stringent hybridization conditions with nucleic acid primers that anneal to a polynucleotide of the invention under such conditions, and amplifying annealed polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is detected in the sample.

[0278] In general, methods for detecting a polypeptide of the invention can comprise contacting a sample with a compound that binds to and forms a complex with the polypeptide for a period sufficient to form the complex, and detecting the complex, so that if a complex is detected, a polypeptide of the invention is detected in the sample.

[0279] In detail, such methods comprise incubating a test sample with one or more of the antibodies or one or more of the nucleic acid probes of the present invention and assaying for binding of the nucleic acid probes or antibodies to components within the test sample.

[0280] Conditions for incubating a nucleic acid probe or antibody with a test sample vary. Incubation conditions depend on the format employed in the assay, the detection methods employed, and the type and nature of the nucleic acid probe or antibody used in the assay. One skilled in the art will recognize that any one of the commonly available hybridization, amplification or immunological assay formats can readily be adapted to employ the nucleic acid probes or antibodies of the present invention. Examples of such assays can be found in Chard, T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, Amsterdam, The Netherlands (1986); Bullock, G. R. et al., Techniques in Immunocytochemistry, Academic Press, Orlando, Fla. Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice and Theory of immunoassays: Laboratory Techniques in Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1985). The test samples of the present invention include cells, protein or membrane extracts of cells, or biological fluids such as sputum, blood, serum, plasma, or urine. The test sample used in the above-described method will vary based on the assay format, nature of the detection method and the tissues, cells or extracts used as the sample to be assayed. Methods for preparing protein extracts or membrane extracts of cells are well known in the art and can be readily be adapted in order to obtain a sample which is compatible with the system utilized.

[0281] In another embodiment of the present invention, kits are provided which contain the necessary reagents to carry out the assays of the present invention. Specifically, the invention provides a compartment kit to receive, in close confinement, one or more containers which comprises: (a) a first container comprising one of the probes or antibodies of the present invention; and (b) one or more other containers comprising one or more of the following: wash reagents, reagents capable of detecting presence of a bound probe or antibody.

[0282] In detail, a compartment kit includes any kit in which reagents are contained in separate containers. Such containers include small glass containers, plastic containers or strips of plastic or paper. Such containers allows one to efficiently transfer reagents from one compartment to another compartment such that the samples and reagents are not cross-contaminated, and the agents or solutions of each container can be added in a quantitative fashion from one compartment to another. Such containers will include a container which will accept the test sample, a container which contains the antibodies used in the assay, containers which contain wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), and containers which contain the reagents used to detect the bound antibody or probe. Types of detection reagents include labeled nucleic acid probes, labeled secondary antibodies, or in the alternative, if the primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of reacting with the labeled antibody. One skilled in the art will readily recognize that the disclosed probes and antibodies of the present invention can be readily incorporated into one of the established kit formats which are well known in the art.

4.15. Medical Imaging

[0283] The novel polypeptides and binding partners of the invention are useful in medical imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the invention is involved in the immune response, for imaging sites of inflammation or infection). See, e.g., Kunkel et al., U.S. Pat. NO. 5,413,778. Such methods involve chemical attachment of a labeling or imaging agent, administration of the labeled polypeptide to a subject in a pharmaceutically acceptable carrier, and imaging the labeled polypeptide in vivo at the target site.

4.16 Screening Assays

[0284] Using the isolated polypeptides or proteins and polynucleotides of the invention, the present invention further provides methods of obtaining and identifying agents which bind to a polypeptide encoded by an ORF corresponding to any of the nucleotide sequences designated as SEQ ID NO: 1-55, or bind to a specific domain of the polypeptide encoded by the nucleic acid. In detail, said method comprises the steps of:

[0285] (a) contacting an agent with an isolated protein encoded by an ORP of the present invention, or nucleic acid of the invention; and

[0286] (b) determining whether the agent binds to said protein or said nucleic acid.

[0287] In general, therefore, such methods for identifying compounds that bind to a polynucleotide of the invention can comprise contacting a compound with a polynucleotide of the invention for a time sufficient to form a polynucleotide/compound complex, and detecting the complex, so that if a polynucleotide/compound complex is detected, a compound that binds to a polynucleotide of the invention is identified.

[0288] Likewise, in general, therefore, such methods for identifying compounds that bind to a polypeptide of the invention can comprise contacting a compound with a polypeptide of the invention for a time sufficient to form a polypeptide/compound complex, and detecting the complex, so that if a polypeptide/compound complex is detected, a compound that binds to a polynucleotide of the invention is identified.

[0289] Methods for identifying compounds that bind to a polypeptide of the invention can also comprise contacting a compound with a polypeptide of the invention in a cell for a time sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a receptor gene sequence in the cell, and detecting the complex by detecting reporter gene sequence expression, so that if a polypeptide/compound complex is detected, a compound that binds a polypeptide of the invention is identified.

[0290] Compounds identified via such methods can include compounds which modulate the activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to activity observed in the absence of the compound). Alternatively, compounds identified via such methods can include compounds which modulate the expression of a polynucleotide of the invention (that is, increase or decrease expression relative to expression levels observed in the absence of the compound). Compounds, such as compounds identified via the methods of the invention, can be tested using standard assays well known to those of skill in the art for their ability to modulate activity/expression.

[0291] The agents screened in the above assay can be, but are not limited to, peptides, carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be selected and screened at random or rationally selected or designed using protein modeling techniques.

[0292] For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and the like are selected at random and are assayed for their ability to bind to the protein encoded by the ORF of the present invention. Alternatively, agents may be rationally selected or designed. As used herein, an agent is said to be “rationally selected or designed” when the agent is chosen based on the configuration of the particular protein. For example, one skilled in the art can readily adapt currently available procedures to generate peptides, pharmaceutical agents and the like capable of binding to a specific peptide sequence in order to generate rationally designed antipeptide peptides, for example see Hurby et al., Application of Synthetic Peptides: Antisense Peptides, “In Synthetic Peptides, A User's Guide, W. H. Freeman, N.Y. (1992), pp. 289-307, and Kaspczak et al., Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like.

[0293] In addition to the foregoing, one class of agents of the present invention, as broadly described, can be used to control gene expression through binding to one of the ORFs or EMFs of the present invention. As described above, such agents can be randomly screened or rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design sequence specific or element specific agents, modulating the expression of either a single ORF or multiple ORFs which rely on the same EMF for expression control. One class of DNA binding agents are agents which contain base residues which hybridize or form a triple helix formation by binding to DNA or RNA. Such agents can be based on the classic phosphodiester, ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric derivatives which have base attachment capacity.

[0294] Agents suitable for use in these methods usually contain 20 to 40 bases and are designed to be complementary to a region of the gene involved in transcription (triple helix—see Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan et al., Science 251:1360 (1991)) or to the mRNA itself (antisense—Okano, J. Neurochem. 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, Fla. (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into polypeptide. Both techniques have been demonstrated to be effective in model systems. Information contained in the sequences of the present invention is necessary for the design of an antisense or triple helix oligonucleotide and other DNA binding agents. Agents which bind to a protein encoded by one of the ORFs of the present invention can be used as a diagnostic agent, in the control of bacterial infection by modulating the activity of the protein encoded by the ORF. Agents which bind to a protein encoded by one of the ORFs of the present invention can be formulated using known techniques to generate a pharmaceutical composition.

4.17. Use of Nucleic Acids as Probes

[0295] Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid hybridization probes capable of hybridizing with naturally occurring nucleotide sequences. The hybridization probes of the subject invention may be derived from any of the nucleotide sequences designated as SEQ ID NO: 1-55. Because the corresponding gene is only expressed in a limited number of tissues, a hybridization probe derived from of any of the nucleotide sequences designated as SEQ ID NO: 1-55 can be used as an indicator of the presence of RNA of cell type of such a tissue in a sample.

[0296] Any suitable hybridization technique can be employed, such as, for example, in situ hybridization. PCR as described in U.S. Pat. Nos. 4,683,195 and 4,965,188 provides additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used in PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both. The probe will comprise a discrete nucleotide sequence for the detection of identical sequences or a degenerate pool of possible sequences for identification of closely related genomic sequences.

[0297] Other means for producing specific hybridization probes for nucleic acids include the cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such vectors are known in the art and are commercially available and may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may be used to construct hybridization probes for mapping their respective genomic sequences. The nucleotide sequence provided herein may be mapped to a chromosome or specific regions of a chromosome using well known genetic and/or chromosomal mapping techniques. These techniques include in situ hybridization, linkage analysis against known chromosomal markers, hybridization screening with libraries or flow-sorted chromosomal preparations specific to known chromosomes, and the like. The technique of fluorescent in situ hybridization of chromosome spreads has been described, among other places, in Verma et al (1988) Human Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York N.Y.

[0298] Fluorescent in situ hybridization of chromosomal preparations and other physical chromosome mapping techniques may be correlated with additional genetic map data. Examples of genetic map data can be found in the 1994 Genome Issue of Science (265:1981f). Correlation between the location of a nucleic acid on a physical chromosomal map and a specific disease (or predisposition to a specific disease) may help delimit the region of DNA associated with that genetic disease. The nucleotide sequences of the subject invention may be used to detect differences in gene sequences between normal, carrier or affected individuals. The nucleotide sequence may be used to produce purified polypeptides using well known methods of recombinant DNA technology. Among the many publications that teach methods for the expression of genes after they have been isolated is Goeddel (1990) Gene Expression Technology, Methods and Enzymology, Vol 185, Academic Press, San Diego. Polypeptides may be expressed in a variety of host cells, either prokaryotic or eukaryotic. Host cells may be from the same species from which a particular polypeptide nucleotide sequence was isolated or from a different species. Advantages of producing polypeptides by recombinant DNA technology include obtaining adequate amounts of the protein for purification and the availability of simplified purification procedures.

4.18 Preparation of Sequencing Chips and Arrays

[0299] A basic example is using 6-mers attached to 50 micron surfaces to give a chip with dimensions of 3×3 mm which can be combined to give an array of 20×20 cm. Another example is using 9-mer oligonucleotides attached to 10×10 microns surface to create a 9-mer chip, with dimensions of 5×5 mm. 4000 units of such chips may be used to create a 30×30 cm array. In an array in which 4,000 to 16,000 oligochips are arranged into a square array. A plate, or collection of tubes, as also depicted, may be packaged with the array as part of the sequencing kit.

[0300] The arrays may be separated physically from each other or by hydrophobic surfaces. One possible way to utilize the hydrophobic strip separation is to use technology such as the Iso-Grid Microbiology System produced by QA Laboratories, Toronto, Canada.

[0301] Hydrophobic grid membrane filters (HGMF) have been in use in analytical food microbiology for about a decade where they exhibit unique attractions of extended numerical range and automated counting of colonies. One commercially-available grid is ISO-GRID™ from QA Laboratories Ltd. (Toronto, Canada) which consists of a square (60×60 cm) of polysulfone polymer (Gelman Tuffryn HT450, 0.45u pore size) on which is printed a black hydrophobic ink grid consisting of 1600 (40×40) square cells. HGMF have previously been inoculated with bacterial suspensions by vacuum filtration and incubated on the differential or selective media of choice.

[0302] Because the microbial growth is confined to grid cells of known position and size on the membrane, the HGMF functions more like an MPN apparatus than a conventional plate or membrane filter. Peterkin et al. (1987) reported that these HGMFs can be used to propagate and store genomic libraries when used with a HGMF replicator. One such instrument replicates growth from each of the 1600 cells of the ISO-GRID and enables many copies of the master HGMF to be made (Peterkin et al., 1987).

[0303] Sharpe et al. (1989) also used ISO-GRID HGMF form QA Laboratories and an automated HGMF counter (MI-100 Interpreter) and RP-100 Replicator. They reported a technique for maintaining and screening many microbial cultures.

[0304] Peterkin and colleagues later described a method for screening DNA probes using the hydrophobic grid-membrane filter (Peterkin et al., 1989). These authors reported methods for effective colony hybridization directly on HGMFs. Previously, poor results had been obtained due to the low DNA binding capacity of the epoxysulfone polymer on which the HGMFs are printed. However, Peterkin et al. (1989) reported that the binding of DNA to the surface of the membrane was improved by treating the replicated and incubated HGMF with polyethyleneimine, a polycation, prior to contact with DNA. Although this early work uses cellular DNA attachment, and has a different objective to the present invention, the methodology described may be readily adapted for Format 3 SBH.

[0305] In order to identify useful sequences rapidly, Peterkin et al. (1989) used radiolabeled plasmid DNA from various clones and tested its specificity against the DNA on the prepared HGMFs. In this way, DNA from recombinant plasmids was rapidly screened by colony hybridization against 100 organisms on HGMF replicates which can be easily and reproducibly prepared.

[0306] Manipulation with small (2-3 mm) chips, and parallel execution of thousands of the reactions. The solution of the invention is to keep the chips and the probes in the corresponding arrays. In one example, chips containing 250,000 9-mers are synthesized on a silicon wafer in the form of 8×8 mM plates (15 uM/oligonucleotide, Pease et al., 1994) arrayed in 8×12 format (96 chips) with a 1 mM groove in between. Probes are added either by multichannel pipette or pin array, one probe on one chip. To score all 4000 6-mers, 42 chip arrays have to be used, either using different ones, or by reusing one set of chip arrays several times.

[0307] In the above case, using the earlier nomenclature of the application, F=9; P=6; and F+P=15. Chips may have probes of formula BxNn, where x is a number of specified bases B; and n is a number of non-specified bases, so that x=4 to 10 and n=1 to 4. To achieve more efficient hybridization, and to avoid potential influence of any support oligonucleotides, the specified bases can be surrounded by unspecified bases, thus represented by a formula such as (N)nBx(N)m.

4.19 Preparation of Support Bound Oligonucleotides

[0308] Oligonucleotides, i.e., small nucleic acid segments, may be readily prepared by, for example, directly synthesizing the oligonucleotide by chemical means, as is commonly practiced using an automated oligonucleotide synthesizer.

[0309] Support bound oligonucleotides may be prepared by any of the methods known to those of skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy is to precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can be achieved using passive adsorption (Inouye & Hondo, 1990); using UV light (Nagata et al., 1985; Dahlen et al., 1987; Morriey & Collins, 1989) or by covalent binding of base modified DNA (Keller et al., 1988; 1989); all references being specifically incorporated herein.

[0310] Another strategy that may be employed is the use of the strong biotin-streptavidin interaction as a linker. For example, Broude et al. (1994) describe the use of Biotinylated probes, although these are duplex probes, that are immobilized on streptavidin-coated magnetic beads. Streptavidin-coated beads may be purchased from Dynal, Oslo. Of course, this same linking chemistry is applicable to coating any surface with streptavidin. Biotinylated probes may be purchased from various sources, such as, e.g., Operon Technologies (Alameda, Calif.).

[0311] Nunc Laboratories (Naperville, Ill.) is also selling suitable material that could be used. Nunc Laboratories have developed a method by which DNA can be covalently bound to the microwell surface termed Covalink NH. CovaLink NH is a polystyrene surface grafted with secondary amino groups (>NH) that serve as bridge-heads for further covalent coupling. CovaLink Modules may be purchased from Nunc Laboratories. DNA molecules may be bound to CovaLink exclusively at the 5′-end by a phosphoramidate bond, allowing immobilization of more than 1 pmol of DNA (Rasmussen et al., 1991).

[0312] The use of CovaLink NH strips for covalent binding of DNA molecules at the 5′-end has been described (Rasmussen et al., 1991). In this technology, a phosphoramidate bond is employed (Chu et al., 1983). This is beneficial as immobilization using only a single covalent bond is preferred. The phosphoramidate bond joins the DNA to the CovaLink NH secondary amino groups that are positioned at the end of spacer arms covalently grafted onto the polystyrene surface through a 2 nm long spacer arm. To link an oligonucleotide to CovaLink NH via an phosphoramidate bond, the oligonucleotide terminus must have a 5′-end phosphate group. It is, perhaps, even possible for biotin to be covalently bound to CovaLink and then streptavidin used to bind the probes.

[0313] More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) and denaturing for 10 min. at 95° C. and cooling on ice for 10 min. Icecold 0.1 M 1-methylimidazole, pH 7.0 (1-MeIm₇), is then added to a final concentration of 10 mM 1-MeIn₇. A ss DNA solution is then dispensed into CovaLink NH strips (75 ul/well) standing on ice.

[0314] Carbodiimide 0.2 M 1-ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), dissolved in 10 mM 1-Melm₇, is made fresh and 25 ul added per well. The strips are incubated for 5 hours at 50° C. After incubation the strips are washed using, e.g., Nunc-Immuno Wash; first the wells are washed 3 times, then they are soaked with washing solution for 5 min., and finally they are washed 3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS heated to 50° C.).

[0315] It is contemplated that a further suitable method for use with the present invention is that described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated herein by reference. This method of preparing an oligonucleotide bound to a support involves attaching a nucleoside 3′-reagent through the phosphate group by a covalent phosphodiester link to aliphatic hydroxyl groups carried by the support. The oligonucleotide is then synthesized on the supported nucleoside and protecting groups removed from the synthetic oligonucleotide chain under standard conditions that do not cleave the oligonucleotide from the support. Suitable reagents include nucleoside phosphoramidite and nucleoside hydrogen phosphorate.

[0316] An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe arrays may be employed. For example, addressable laser-activated photodeprotection may be employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described by Fodor et al. (1991), incorporated herein by reference. Probes may also be immobilized on nylon supports as described by Van Ness et al. (1991); or linked to Teflon using the method of Duncan & Cavalier (1988); all references being specifically incorporated herein.

[0317] To link an oligonucleotide to a nylon support, as described by Van Ness et al. (1991), requires activation of the nylon surface via alkylation and selective activation of the 5′-amine of oligonucleotides with cyanuric chloride.

[0318] One particular way to prepare support bound oligonucleotides is to utilize the light-generated synthesis described by Pease et al., (1994, incorporated herein by reference). These authors used current photolithographic techniques to generate arrays of immobilized oligonucleotide probes (DNA chips). These methods, in which light is used to direct the synthesis of oligonucleotide probes in high-density, miniaturized arrays, utilize photolabile 5′-protected N-acyl-deoxynucleoside phosphoramidites, surface linker chemistry and versatile combinatorial synthesis strategies. A matrix of 256 spatially defined oligonucleotide probes may be generated in this manner and then used in the advantageous Format 3 sequencing, as described herein.

4.20 Preparation of Nucleic Acid Fragments

[0319] The nucleic acids to be sequenced may be obtained from any appropriate source, such as cDNAs, genomic DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC inserts, and RNA, including mRNA without any amplification steps. For example, Sambrook et al. (1989) describes three protocols for the isolation of high molecular weight DNA from mammalian cells (p. 9.14-9.23).

[0320] DNA fragments may be prepared as clones in M13, plasmid or lambda vectors and/or prepared directly from genomic DNA or cDNA by PCR or other amplification methods. Samples may be prepared or dispensed in multiwell plates. About 100-1000 ng of DNA samples may be prepared in 2-500 ml of final volume.

[0321] The nucleic acids would then be fragmented by any of the methods known to those of skill in the art including, for example, using restriction enzymes as described at 9.24-9.28 of Sambrook et al. (1989), shearing by ultrasound and NaOH treatment.

[0322] Low pressure shearing is also appropriate, as described by Schriefer et al. (1990, incorporated herein by reference). In this method, DNA samples are passed through a small French pressure cell at a variety of low to intermediate pressures. A lever device allows controlled application of low to intermediate pressures to the cell. The results of these studies indicate that low-pressure shearing is a useful alternative to sonic and enzymatic DNA fragmentation methods.

[0323] One particularly suitable way for fragmenting DNA is contemplated to be that using the two base recognition endonuclease, CviJI, described by Fitzgerald et al. (1992). These authors described an approach for the rapid fragmentation and fractionation of DNA into particular sizes that they contemplated to be suitable for shotgun cloning and sequencing. The present inventor envisions that this will also be particularly useful for generating random, but relatively small, fragments of DNA for use in the present sequencing technology.

[0324] The restriction endonuclease CviJI normally cleaves the recognition sequence PuGCPy between the G and C to leave blunt ends. Atypical reaction conditions, which alter the specificity of this enzyme (CviJI**), yield a quasi-random distribution of DNA fragments form the small molecule pUC19 (2688 base pairs). Fitzgerald et al. (1992) quantitatively evaluated the randomness of this fragmentation strategy, using a CviJI** digest of pUC19 that was size fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z minus M13 cloning vector. Sequence analysis of 76 clones showed that CviJI** restricts pyGCPy and PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated at a rate consistent with random fragmentation.

[0325] As reported in the literature, advantages of this approach compared to sonication and agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug instead of 2-5 ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or agarose gel electrophoresis and elution are needed). These advantages are also proposed to be of use when preparing DNA for sequencing by Format 3.

[0326] Irrespective of the manner in which the nucleic acid fragments are obtained or prepared, it is important to denature the DNA to give single stranded pieces available for hybridization. This is achieved by incubating the DNA solution for 2-5 minutes at 80-90° C. The solution is then cooled quickly to 2° C. to prevent renaturation of the DNA fragments before they are contacted with the chip. Phosphate groups must also be removed from genomic DNA by methods known in the art.

4.21 Preparation of DNA Arrays

[0327] Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. Spotting may be performed by using arrays of metal pins (the positions of which correspond to an array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a DNA solution to a nylon membrane. By offset printing, a density of dots higher than the density of the wells is achieved. One to 25 dots may be accommodated in 1 mm², depending on the type of label used. By avoiding spotting in some preselected number of rows and columns, separate subsets (subarrays) may be formed. Samples in one subarray may be the same genomic segment of DNA (or the same gene) from different individuals, or may be different, overlapped genomic clones. Each of the subarrays may represent replica spotting of the same samples. In one example, a selected gene segment may be amplified from 64 patients. For each patient, the amplified gene segment may be in one 96-well plate (all 96 wells containing the same sample). A plate for each of the 64 patients is prepared. By using a 96-pin device, all samples may be spotted on one 8×12 cm membrane. Subarrays may contain 64 samples, one from each patient. Where the 96 subarrays are identical, the dot span may be 1 mm² and there may be a 1 mm space between subarrays.

[0328] Another approach is to use membranes or plates (available from NUNC, Naperville, Ill.) which may be partitioned by physical spacers e.g. a plastic grid molded over the membrane, the grid being similar to the sort of membrane applied to the bottom of multiwell plates, or hydrophobic strips. A fixed physical spacer is not preferred for imaging by exposure to flat phosphor-storage screens or x-ray films.

[0329] The present invention is illustrated in the following examples. Upon consideration of the present disclosure, one of skill in the art will appreciate that many other embodiments and variations may be made in the scope of the present invention. Accordingly, it is intended that the broader aspects of the present invention not be limited to the disclosure of the following examples. The present invention is not to be limited in scope by the exemplified embodiments which are intended as illustrations of single aspects of the invention, and compositions and methods which are functionally equivalent are within the scope of the invention. Indeed, numerous modifications and variations in the practice of the invention are expected to occur to those skilled in the art upon consideration of the present preferred embodiments. Consequently, the only limitations which should be placed upon the scope of the invention are those which appear in the appended claims.

[0330] All references cited within the body of the instant specification are hereby incorporated by reference in their entirety.

EXAMPLES Example 1 Novel Nucleic Acid Sequences Obtained from Various Libraries

[0331] A plurality of novel nucleic acids were obtained from cDNA libraries prepared from various human tissues and in some cases isolated from a genomic library derived from human chromosome using standard PCR, SBH sequence signature analysis and Sanger sequencing techniques. The inserts of the library were amplified with PCR using primers specific for the vector sequences which flank the inserts. Clones from cDNA libraries were spotted on nylon membrane filters and screened with oligonucleotide probes (e.g., 7-mers) to obtain signature sequences. The clones were clustered into groups of similar or identical sequences. Representative clones were selected for sequencing.

[0332] In some cases, the 5′ sequence of the amplified inserts was then deduced using a typical Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In some cases RACE (Random Amplification of cDNA Ends) was performed to further extend the sequence in the 5′ direction.

Example 2 Novel Nucleic Acids

[0333] The novel nucleic acids of the present invention of the invention were assembled from sequences that were obtained from a cDNA library by methods described in Example 1 above, and in some cases sequences obtained from one or more public databases. The nucleic acids were assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend the seed EST into an extended assemblage, by pulling additional sequences from different databases (i.e., Hyseq's database containing EST sequences, dbEST version 114, gb pri 114, and UniGene version 101) that belong to this assemblage. The algorithm terminated when there was no additional sequences from the above databases that would extend the assemblage. Inclusion of component sequences into the assemblage was based on a BLASTN hit to the extending assemblage with BLAST score greater than 300 and percent identity greater than 95%.

[0334] The nearest neighbor results for SEQ ID NO: 1-55 were obtained by a FASTA version 3 search against Genpept release 117, using FASTXY algorithm. FASTXY is an improved version of FASTA alignment which allows in-codon frame shifts. The nearest neighbor result showed the closest homologue for SEQ ID NO: 1-55 from Genpept (and contains the translated amino acid sequences for which the nucleic acid sequence encodes). The nearest neighbor results for SEQ ID NO: 1-55 are shown in Table 1 below. TABLE 1 CORRESPONDING SEQ ID NO. IN PRIORITY SEQ ID NO. APPLICATION OF CUR- ATTORNEY SMITH- RENT AP- DOCKET NO. ACCESSION WATERMAN % PLICATION 789 CIP NUMBER DESCRIPTION SCORE IDENTITY 1 4512 M12987 Plasmid F Protein 2226 95.376 A 2 4541 D88010 Homo sapiens 978 100.000 ribosomal protein S13 3 4581 M81757 Homo sapiens S19 957 100.000 ribosomal protein 4 4718 AB021642 Homo sapiens 4266 99.660 gonadotropin inducible transcription repressor-2 5 4727 C13905 Rattus sp. rab1B 1286 98.010 protein (AA 1- 201) 6 4742 A36559 unidentified 1815 100.000 RECEPTOR GR33 DE RAT 7 4760 AF007151 Homo sapiens 2214 100.000 unknown 8 4956 X06323 Homo sapiens put. 2438 100.000 ribosomal protein L3 (AA 1-348) 9 4998 L07493 Homo sapiens 594 100.000 replication protein A 14 kDa subunit 10 5046 AF197927 Homo sapiens 7637 99.914 AF5q31 protein 11 5247 X52318 Bos taurus 802 100.000 histone H2A.Z (AA 1-127) 12 5271 D38549 Homo sapiens 4482 100.000 ha1025 is new 13 5290 Z34975 Homo sapiens 4765 100.000 ldlCp 14 5463 AF096895 Homo sapiens 646 100.000 chemokine-like factor 1 15 5463 AF135380 Homo sapiens 989 100.000 chemokine-like factor 2 16 5507 AK001490 Homo sapiens 998 90.184 unnamed protein product 17 5522 L12693 Mus sp. nucleic 1298 100.000 acid binding protein 18 5522 M28372 Homo sapiens SRE- 1339 99.435 binding protein 19 5538 X87373 Homo sapiens 1426 100.000 ribosomal protein S3a 20 5556 AF053628 Mus musculus 1997 88.623 D3Mm3e 21 5556 AF053628 Mus musculus 1997 88.623 D3Mm3e 22 5556 AF053628 Mus musculus 1997 88.623 D3Mm3e 23 5596 AF113124 Homo sapiens FEZ2 1433 91.083 protein 24 5596 U64689 Rattus norvegicus 1489 94.156 zyginII 25 5618 J03075 Homo sapiens 80K- 3621 100.000 H protein 26 5618 J03075 Homo sapiens 80K- 3621 100.000 H protein 27 5618 J03075 Homo sapiens 80K- 3621 100.000 H protein 28 5622 M27024 Homo sapiens heat 2469 95.782 shock protein 29 5625 U36764 Homo sapiens TGF- 1349 84.615 beta receptor interacting protein 1 30 5625 U36764 Homo sapiens TGF- 1740 99.692 beta receptor interacting protein 1 31 5638 X14723 Homo sapiens SP- 2447 100.000 40,40 prepropeptide (AA −22 to 427) 32 5649 D30648 Homo sapiens 4023 92.620 flavoprotein subunit of complex II 33 5649 D30648 Homo sapiens 4023 92.620 flavoprotein subunit of complex II 34 5650 D45887 Homo sapiens 959 100.000 calmodulin 35 5651 AK000374 Homo sapiens 1070 100.000 unnamed protein product 36 5655 AL035462 Homo sapiens 1103 100.000 dJ995J12.1 (similar to ganglioside- induced differentiation associated protein 1) 37 5665 AC004472 Homo sapiens 5221 98.160 TERA_HUMAN 38 5698 U61837 Homo sapiens 953 100.000 putative cyclin G1 interacting protein 39 5699 U20324 Homo sapiens LIM 1163 100.000 domain protein 40 5717 AF182844 Homo sapiens 1468 100.000 VPS28 protein 41 5721 V00566 Homo sapiens 1513 100.000 reading frame prolactin 42 5723 AE003681 Drosophila 1968 61.368 melanogaster CG11964 gene product 43 5741 V00572 Homo sapiens 2717 100.000 coding sequence 44 5744 D00068 Homo sapiens (2′- 2017 100.000 5′) oligoadenylate synthetase 45 5751 J03275 Bos taurus ADP- 968 100.000 ribosylation factor 46 5752 X98248 Homo sapiens 5677 100.000 sortilin 47 5759 S78986 Homo sapiens 999 100.000 transcription regulator helix- loop-helix protein=Idl Idl-a 48 5761 AF047470 Homo sapiens 2171 100.000 malate dehydrogenase precursor 49 5903 AE003598 Drosophila 451 50 melanogaster CG7369 gene product 50 5910 AB020626 Homo sapiens 6534 99.898 KIAA0819 protein 51 5914 AK000214 Homo sapiens 2385 91.392 unnamed protein product 52 5936 M15885 Homo sapiens 813 100.000 seminal plasma protein precursor 53 5941 X55110 Homo sapiens 1019 100.000 neurite outgrowth- promoting protein 54 6251 D29958 Homo sapiens 1897 99.655 KIAA0116 55 6370 U22456 Homo sapiens 184 36.047 AMP-activated pro- tein kinase homolog

[0335]

1 55 1 1414 DNA Homo sapiens CDS (141)..(1217) 1 tatataaagg gggaggatgg gttttcttct ccttggaagg gggttctatt taaccacttg 60 cggtttttta aaccttgcga ttggttgtcc tttcccatta aacgcaaatt taataaaaaa 120 ccccaaacca tttttaaagg atg ttc aga atg aat ttc atg aag ccc tta 170 Met Phe Arg Met Asn Phe Met Lys Pro Leu 1 5 10 acc cag tgc ata acc gtt ggt cat gaa atg tcg aag gct atc gcc att 218 Thr Gln Cys Ile Thr Val Gly His Glu Met Ser Lys Ala Ile Ala Ile 15 20 25 gcg cag ttt aat gat gac agc ccg aaa gcg agg aaa ata ccc cgg cgc 266 Ala Gln Phe Asn Asp Asp Ser Pro Lys Ala Arg Lys Ile Pro Arg Arg 30 35 40 tgg aga ata ggt aaa gca gcg gat tta gtt ggg gtt tct tct cag gct 314 Trp Arg Ile Gly Lys Ala Ala Asp Leu Val Gly Val Ser Ser Gln Ala 45 50 55 atc aga gat gcc gag aaa gca ggg cga ctc ccg cac ccg gat atg gaa 362 Ile Arg Asp Ala Glu Lys Ala Gly Arg Leu Pro His Pro Asp Met Glu 60 65 70 att cga gga cgg gtt gag caa cgt gtt ggt tat aca att gaa caa att 410 Ile Arg Gly Arg Val Glu Gln Arg Val Gly Tyr Thr Ile Glu Gln Ile 75 80 85 90 aat cat atg cgt cat gtg ttt ggt ccg cga ttg cag cgt gct gaa aac 458 Asn His Met Arg His Val Phe Gly Pro Arg Leu Gln Arg Ala Glu Asn 95 100 105 gta ttt cca ccg gtg ctc ggg gtt gct gcc cat aaa ggt ggc gtt tac 506 Val Phe Pro Pro Val Leu Gly Val Ala Ala His Lys Gly Gly Val Tyr 110 115 120 aaa acc tca gtt tct gtt cat ctt gct cag gat ctg gct ctg aag ggg 554 Lys Thr Ser Val Ser Val His Leu Ala Gln Asp Leu Ala Leu Lys Gly 125 130 135 cta cgt gtt ttg ctc gtg gaa ggt aac gac ccc cag gga aca gcc tca 602 Leu Arg Val Leu Leu Val Glu Gly Asn Asp Pro Gln Gly Thr Ala Ser 140 145 150 atg tat cac gga tgg gta cca gat ctt cat att cat gca gaa gac act 650 Met Tyr His Gly Trp Val Pro Asp Leu His Ile His Ala Glu Asp Thr 155 160 165 170 ctc ctg cct ttc tat ctt ggg gaa aag gac gat gtc act tat gca ata 698 Leu Leu Pro Phe Tyr Leu Gly Glu Lys Asp Asp Val Thr Tyr Ala Ile 175 180 185 aag ccc act tgc tgg ccg ggg ctt gac att att cct tcc tgt ctg gct 746 Lys Pro Thr Cys Trp Pro Gly Leu Asp Ile Ile Pro Ser Cys Leu Ala 190 195 200 ctg cac cgt att gaa act gag tta atg ggc aaa ttt gat gaa ggt aaa 794 Leu His Arg Ile Glu Thr Glu Leu Met Gly Lys Phe Asp Glu Gly Lys 205 210 215 ctg ccc acc gat cca cac ctg atg ctc cga ctg gcc att gaa act gtt 842 Leu Pro Thr Asp Pro His Leu Met Leu Arg Leu Ala Ile Glu Thr Val 220 225 230 gct cat gac tat gat gtc ata gtt att gac agc gcg cct aac ctg ggt 890 Ala His Asp Tyr Asp Val Ile Val Ile Asp Ser Ala Pro Asn Leu Gly 235 240 245 250 atc ggc acg att aat gtc gta tgt gct gct gat gtg ctg att gtt ccc 938 Ile Gly Thr Ile Asn Val Val Cys Ala Ala Asp Val Leu Ile Val Pro 255 260 265 acg cct gct gag ttg ttt gac tac acc tcc gca ctg cag ttt ttc gat 986 Thr Pro Ala Glu Leu Phe Asp Tyr Thr Ser Ala Leu Gln Phe Phe Asp 270 275 280 atg ctt cgt gat ctg ctc aag aac gtt gat ctt aaa ggg ttc gag cct 1034 Met Leu Arg Asp Leu Leu Lys Asn Val Asp Leu Lys Gly Phe Glu Pro 285 290 295 gat gta cgt att ttg ctt acc aaa tac agc aat agt aat ggc tct cag 1082 Asp Val Arg Ile Leu Leu Thr Lys Tyr Ser Asn Ser Asn Gly Ser Gln 300 305 310 tcc ccg tgg atg gag gag caa att cgg gat gcc tgg gga agc atg gtt 1130 Ser Pro Trp Met Glu Glu Gln Ile Arg Asp Ala Trp Gly Ser Met Val 315 320 325 330 cta aaa aat gtt gta cgt gaa acg gat gaa gtt ggt aaa ggt cag atc 1178 Leu Lys Asn Val Val Arg Glu Thr Asp Glu Val Gly Lys Gly Gln Ile 335 340 345 aac aat cat cgc gct acc gga ggc aat ttc cgg ttg tga agcggtggta 1227 Asn Asn His Arg Ala Thr Gly Gly Asn Phe Arg Leu * 350 355 gcggctgccg ttttcgcaac ggcttgcggt gcaaaaggca cagccagcat cagggccagg 1287 ctaaataaag aaactcgaaa tttcggcatg atgagcattc agatagtggt tcacgcgcac 1347 gggttgcgca ccgccggagt aaggatttac tgaggctagc gacgccatca taacgagcaa 1407 aaagtgc 1414 2 555 DNA Homo sapiens CDS (49)..(504) 2 gtaccggtcc ggaattcccg ggtcgaccca cgcgtccgcg ccgccatc atg ggt cgc 57 Met Gly Arg 1 atg cat gct ccc ggg aag ggc ctg tcc cag tcg gct tta ccc tat cga 105 Met His Ala Pro Gly Lys Gly Leu Ser Gln Ser Ala Leu Pro Tyr Arg 5 10 15 cgc agc gtc ccc act tgg ttg aag ttg aca tct gac gac gtg aag gag 153 Arg Ser Val Pro Thr Trp Leu Lys Leu Thr Ser Asp Asp Val Lys Glu 20 25 30 35 cag att tac aaa ctg gcc aag aag ggc ctt act cct tca cag atc ggt 201 Gln Ile Tyr Lys Leu Ala Lys Lys Gly Leu Thr Pro Ser Gln Ile Gly 40 45 50 gta atc ctg aga gat tca cat ggt gtt gca caa gta cgt ttt gtg aca 249 Val Ile Leu Arg Asp Ser His Gly Val Ala Gln Val Arg Phe Val Thr 55 60 65 ggc aat aaa att tta aga att ctt aag tct aag gga ctt gct cct gat 297 Gly Asn Lys Ile Leu Arg Ile Leu Lys Ser Lys Gly Leu Ala Pro Asp 70 75 80 ctt cct gaa gat ctc tac cat tta att aag aaa gca gtt gct gtt cga 345 Leu Pro Glu Asp Leu Tyr His Leu Ile Lys Lys Ala Val Ala Val Arg 85 90 95 aag cat ctt gag agg aac aga aag gat aag gat gct aaa ttc cgt ctg 393 Lys His Leu Glu Arg Asn Arg Lys Asp Lys Asp Ala Lys Phe Arg Leu 100 105 110 115 att cta ata gag agc cgg att cac cgt ttg gct cga tat tat aag acc 441 Ile Leu Ile Glu Ser Arg Ile His Arg Leu Ala Arg Tyr Tyr Lys Thr 120 125 130 aag cga gtc ctc cct ccc aat tgg aaa tat gaa tca tct aca gcc tct 489 Lys Arg Val Leu Pro Pro Asn Trp Lys Tyr Glu Ser Ser Thr Ala Ser 135 140 145 gcc ctg gtc gca taa atttgtctgt gtactcaagc aataaaatga ttgtttaact 544 Ala Leu Val Ala * 150 aaaaaaaaaa a 555 3 580 DNA Homo sapiens CDS (94)..(531) 3 attgaagccg tacgcctgca ggtaccggtc cggaattccc gggtcgaccc acgcgtccgc 60 ggacgcgtgg gctggcagcg cggaggccgc acg atg cct gga gtt act gta aaa 114 Met Pro Gly Val Thr Val Lys 1 5 gac gtg aac cag cag gag ttc gtc aga gct ctg gca gcc ttc ctc aaa 162 Asp Val Asn Gln Gln Glu Phe Val Arg Ala Leu Ala Ala Phe Leu Lys 10 15 20 aag tcc ggg aag ctg aaa gtc ccc gaa tgg gtg gat acc gtc aag ctg 210 Lys Ser Gly Lys Leu Lys Val Pro Glu Trp Val Asp Thr Val Lys Leu 25 30 35 gcc aag cac aaa gag ctt gct ccc tac gat gag aac tgg ttc tac acg 258 Ala Lys His Lys Glu Leu Ala Pro Tyr Asp Glu Asn Trp Phe Tyr Thr 40 45 50 55 cga gct gct tcc aca gcg cgg cac ctg tac ctc cgg ggt ggc gct ggg 306 Arg Ala Ala Ser Thr Ala Arg His Leu Tyr Leu Arg Gly Gly Ala Gly 60 65 70 gtt ggc tcc atg acc aag atc tat ggg gga cgt cag aga aac ggc gtc 354 Val Gly Ser Met Thr Lys Ile Tyr Gly Gly Arg Gln Arg Asn Gly Val 75 80 85 atg ccc agc cac ttc agc cga ggc tcc aag agt gtg gcc cgc cgg gtc 402 Met Pro Ser His Phe Ser Arg Gly Ser Lys Ser Val Ala Arg Arg Val 90 95 100 ctc caa gcc ctg gag ggg ctg aaa atg gtg gaa aag gac caa gat ggc 450 Leu Gln Ala Leu Glu Gly Leu Lys Met Val Glu Lys Asp Gln Asp Gly 105 110 115 ggc cgc aaa ctg aca cct cag gga caa aga gat ctg gac aga atc gcc 498 Gly Arg Lys Leu Thr Pro Gln Gly Gln Arg Asp Leu Asp Arg Ile Ala 120 125 130 135 gga cag gtg gca gct gcc aac aag aag cat tag aacaaacc atgctgggtt 549 Gly Gln Val Ala Ala Ala Asn Lys Lys His * 140 145 aataaattgc ctcattcgta aaaaaaaaaa a 580 4 2772 DNA Homo sapiens CDS (319)..(2166) 4 cctgcaggta ccggtccgga attcccgggt cgacgatttc gtgtgtgacg ttccaggagc 60 tagtggcctc ttcaccctgg tgacctctgt tccgtattct gtcactgaga gacgccctgg 120 gacatctgtg gtggcttttg tcgcgctggg acctaccctg actacgggag ttgggaggac 180 ccgggacacc gcacagccgg gaaatgattg tgcggcggaa ccagtacagg ttcgcagccc 240 aagtcactgc acccggacgc cgagggctgc agcagaaacg gtttaatagg agaggacacc 300 ctagatccgc ctccgtaa atg gac tca gtg gcc ttt gag gat gtg gct gtg 351 Met Asp Ser Val Ala Phe Glu Asp Val Ala Val 1 5 10 aac ttc acc cat gag gag tgg gct ttg ctg ggt cca tca cag aag aat 399 Asn Phe Thr His Glu Glu Trp Ala Leu Leu Gly Pro Ser Gln Lys Asn 15 20 25 ctc tac aga gat gtg atg cga gaa acc att agg aac ctg aac tgt ata 447 Leu Tyr Arg Asp Val Met Arg Glu Thr Ile Arg Asn Leu Asn Cys Ile 30 35 40 gga atg aaa tgg gaa aac cag aac att gat gat cag cac caa aat ctc 495 Gly Met Lys Trp Glu Asn Gln Asn Ile Asp Asp Gln His Gln Asn Leu 45 50 55 agg aga aat cca agg tgt gat gtg gta gag aga ttt ggt aaa agt aaa 543 Arg Arg Asn Pro Arg Cys Asp Val Val Glu Arg Phe Gly Lys Ser Lys 60 65 70 75 gat ggt agt cag tgt gga gaa acc tta agc cag att cga aat agt att 591 Asp Gly Ser Gln Cys Gly Glu Thr Leu Ser Gln Ile Arg Asn Ser Ile 80 85 90 gta aac aag aac act ccc gcc aga gta gat gca tgt gga agc agt gtg 639 Val Asn Lys Asn Thr Pro Ala Arg Val Asp Ala Cys Gly Ser Ser Val 95 100 105 aat gga gaa gtc ata atg ggt cat tca tcc ctg aat tgc tac atc aga 687 Asn Gly Glu Val Ile Met Gly His Ser Ser Leu Asn Cys Tyr Ile Arg 110 115 120 gtt gat act gga cac aaa cac cgg gag tgt cat gaa tat gca gag aag 735 Val Asp Thr Gly His Lys His Arg Glu Cys His Glu Tyr Ala Glu Lys 125 130 135 tca tat aca cat aag cag tgt ggg aaa ggc tta agt tat cgc cac tcc 783 Ser Tyr Thr His Lys Gln Cys Gly Lys Gly Leu Ser Tyr Arg His Ser 140 145 150 155 ttt caa aca tgt gaa agg cct cac gct gga aag aaa ccc tat gat tgt 831 Phe Gln Thr Cys Glu Arg Pro His Ala Gly Lys Lys Pro Tyr Asp Cys 160 165 170 aag gaa tgt gga aaa acc ttc agt tct cct gga aac ctt cga aga cat 879 Lys Glu Cys Gly Lys Thr Phe Ser Ser Pro Gly Asn Leu Arg Arg His 175 180 185 atg gta gta aaa ggt gga gat gga cct tat aaa tgt gaa ttg tgt ggg 927 Met Val Val Lys Gly Gly Asp Gly Pro Tyr Lys Cys Glu Leu Cys Gly 190 195 200 aaa gcc ttt ttt tgg ccc agt tta tta cgt atg cat gaa aga act cac 975 Lys Ala Phe Phe Trp Pro Ser Leu Leu Arg Met His Glu Arg Thr His 205 210 215 act gga gag aaa cca tat gaa tgt aag cag tgt tct aaa gcc ttc cct 1023 Thr Gly Glu Lys Pro Tyr Glu Cys Lys Gln Cys Ser Lys Ala Phe Pro 220 225 230 235 gtt tac agt tcc tat cta aga cat gaa aaa ata cac act ggg gag aaa 1071 Val Tyr Ser Ser Tyr Leu Arg His Glu Lys Ile His Thr Gly Glu Lys 240 245 250 ccg tat gaa tgt aag cag tgt tct aaa gcc ttc cct gat tac agt tca 1119 Pro Tyr Glu Cys Lys Gln Cys Ser Lys Ala Phe Pro Asp Tyr Ser Ser 255 260 265 tat cta aga cat gaa aga act cac act gga gag aaa ccc tac aaa tgt 1167 Tyr Leu Arg His Glu Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys 270 275 280 aaa caa tgg ggg aaa gcc ttc agt gtt tcc ggt tcc ctt cga gta cat 1215 Lys Gln Trp Gly Lys Ala Phe Ser Val Ser Gly Ser Leu Arg Val His 285 290 295 gaa aga att cac act gga gag aaa ccc tat aca tgt aaa cag tgt ggg 1263 Glu Arg Ile His Thr Gly Glu Lys Pro Tyr Thr Cys Lys Gln Cys Gly 300 305 310 315 aaa gcg ttt tgt cat ctt gga agc ttt caa aga cac atg ata atg cac 1311 Lys Ala Phe Cys His Leu Gly Ser Phe Gln Arg His Met Ile Met His 320 325 330 agt gga gat gga cct cat aaa tgt aag ata tgt ggg aaa ggc ttt gat 1359 Ser Gly Asp Gly Pro His Lys Cys Lys Ile Cys Gly Lys Gly Phe Asp 335 340 345 ttt cct ggt tca gca cga att cat gaa gga act cac act cta gag aaa 1407 Phe Pro Gly Ser Ala Arg Ile His Glu Gly Thr His Thr Leu Glu Lys 350 355 360 ccc tat gaa tgt aag caa tgt ggg aaa ttg tta tct cat cgc tca agc 1455 Pro Tyr Glu Cys Lys Gln Cys Gly Lys Leu Leu Ser His Arg Ser Ser 365 370 375 ttt cga aga cac atg atg gca cac act gga gat ggc cct cat aaa tgc 1503 Phe Arg Arg His Met Met Ala His Thr Gly Asp Gly Pro His Lys Cys 380 385 390 395 aca gta tgt ggg aaa gcc ttt gat tct cct agt gta ttt caa aga cat 1551 Thr Val Cys Gly Lys Ala Phe Asp Ser Pro Ser Val Phe Gln Arg His 400 405 410 gaa agg act cac act gga gag aaa ccc tat gaa tgc aag caa tgt ggg 1599 Glu Arg Thr His Thr Gly Glu Lys Pro Tyr Glu Cys Lys Gln Cys Gly 415 420 425 aaa gcc ttc cgt act tcc agt tcc ctt cga aaa cat gaa aca aca cac 1647 Lys Ala Phe Arg Thr Ser Ser Ser Leu Arg Lys His Glu Thr Thr His 430 435 440 act gga gag caa ccc tat aaa tgt aaa tgt gga aaa gct ttt agt gat 1695 Thr Gly Glu Gln Pro Tyr Lys Cys Lys Cys Gly Lys Ala Phe Ser Asp 445 450 455 tta ttt tcc ttt caa agt cat gaa aca aca cac agt gaa gag gag cct 1743 Leu Phe Ser Phe Gln Ser His Glu Thr Thr His Ser Glu Glu Glu Pro 460 465 470 475 tat gaa tgt aag gag tgt ggg aaa gca ttt agt tct ttt aaa tac ttt 1791 Tyr Glu Cys Lys Glu Cys Gly Lys Ala Phe Ser Ser Phe Lys Tyr Phe 480 485 490 tgt cgc cat gaa agg act cac agt gaa gaa aaa tct tat gag tgt caa 1839 Cys Arg His Glu Arg Thr His Ser Glu Glu Lys Ser Tyr Glu Cys Gln 495 500 505 att tgt ggc aaa gcc ttc agt cgt ttc agt tac tta aaa act cat gaa 1887 Ile Cys Gly Lys Ala Phe Ser Arg Phe Ser Tyr Leu Lys Thr His Glu 510 515 520 agg act cac acg gca gag aag cca tat gaa tgt aag caa tgc agg aaa 1935 Arg Thr His Thr Ala Glu Lys Pro Tyr Glu Cys Lys Gln Cys Arg Lys 525 530 535 gca ttc ttt tgg ccc tct ttc ctt cta aga cat gaa agg act cac act 1983 Ala Phe Phe Trp Pro Ser Phe Leu Leu Arg His Glu Arg Thr His Thr 540 545 550 555 gga gaa aga ccc tat gaa tgt aaa cac tgt ggt aaa gcc ttc agt cgt 2031 Gly Glu Arg Pro Tyr Glu Cys Lys His Cys Gly Lys Ala Phe Ser Arg 560 565 570 tcc agt ttc tgt cga gaa cat gaa aga act cac act gga gag aag ccc 2079 Ser Ser Phe Cys Arg Glu His Glu Arg Thr His Thr Gly Glu Lys Pro 575 580 585 tat gaa tgt aag gaa tgt ggg aaa gcc ttc agt tct ctc agt tcc ttt 2127 Tyr Glu Cys Lys Glu Cys Gly Lys Ala Phe Ser Ser Leu Ser Ser Phe 590 595 600 aat aga cat aaa agg aca cac tgg aag gat att cta taa gtgtatggaa 2176 Asn Arg His Lys Arg Thr His Trp Lys Asp Ile Leu * 605 610 615 tgtgggaaag cattcattgg ttttatcaca ttcagatact tgaaagaaat aaatcctgtg 2236 aatgtaaacg tggtaaagcc ttaagaagtt tccaggctgg gcgcagcggc tcacacctgt 2296 aatcccagca ctttgagagg ccgaggaggg cagatcacga ggccaggaga tcgagaccag 2356 cctggctaac atgggaaacc ctgtctctac taaaaatacg gaaaaaaaaa aaaatagcca 2416 ggcatagttg ctcacacctg tagtcctagc tactcaggag gctgaggcag gagaatccct 2476 tgaacccggg aggtggaggt tgcagtgagc cgagattgca ctactgcact ccagcttggg 2536 tgctagagcg agactccatc tcaaaaaaaa aaaaaaaagt ttccatttct ttcaaataga 2596 gttgctgcct gctatatgca agaagattgg ttccagtaca ccctgagtat acctaaatcc 2656 acagatgcca gctcttttat aaaatggaat attcgcatgt acctacccac attctcctgt 2716 atactctata aatgtctaga ttaattaaaa tatctcatgc attgtaaaaa aaaaaa 2772 5 2078 DNA Homo sapiens CDS (232)..(837) 5 attctataga tcagaggtta catggccaag attgaaactt agaggagtat agttacataa 60 aagaaggcaa aacgatgtat aaatgaaaga aattgagatg gtgcacgatg cacagttgaa 120 gtgaacttgc ggggtttttc agtatctacg attcatagat ctggaattcg cggccgcgtc 180 gacgcggagc agagtcgact gggagcgacc gagcgggccg ccgccgccgc c atg aac 237 Met Asn 1 ccc gaa tat gac tac ctg ttt aag ctg ctt ttg att ggc gac tca ggc 285 Pro Glu Tyr Asp Tyr Leu Phe Lys Leu Leu Leu Ile Gly Asp Ser Gly 5 10 15 gtg ggc aag tca tgc ctg ctc ctg cgg ttt gct gat gac acg tac aca 333 Val Gly Lys Ser Cys Leu Leu Leu Arg Phe Ala Asp Asp Thr Tyr Thr 20 25 30 gag agc tac atc agc acc atc ggg gtg gac ttc aag atc cga acc atc 381 Glu Ser Tyr Ile Ser Thr Ile Gly Val Asp Phe Lys Ile Arg Thr Ile 35 40 45 50 gag ctg gat ggc aaa act atc aaa ctt cag atc tgg gac aca gcg ggc 429 Glu Leu Asp Gly Lys Thr Ile Lys Leu Gln Ile Trp Asp Thr Ala Gly 55 60 65 cag gaa cgg ttc cgg acc atc act tcc agc tac tac cgg ggg gct cat 477 Gln Glu Arg Phe Arg Thr Ile Thr Ser Ser Tyr Tyr Arg Gly Ala His 70 75 80 ggc atc atc gtg gtg tat gac gtc act gac cag gaa tcc tac gcc aac 525 Gly Ile Ile Val Val Tyr Asp Val Thr Asp Gln Glu Ser Tyr Ala Asn 85 90 95 gtg aag cag tgg ctg cag gag att gac cgc tat gcc agc gag aac gtc 573 Val Lys Gln Trp Leu Gln Glu Ile Asp Arg Tyr Ala Ser Glu Asn Val 100 105 110 aat aag ctc ctg gtg ggc aac aag agc gac ctc acc acc aag aag gtg 621 Asn Lys Leu Leu Val Gly Asn Lys Ser Asp Leu Thr Thr Lys Lys Val 115 120 125 130 gtg gac aac acc aca gcc aag gag ttt gca gac tct ctg ggc atc ccc 669 Val Asp Asn Thr Thr Ala Lys Glu Phe Ala Asp Ser Leu Gly Ile Pro 135 140 145 ttc ttg gag acg agc gcc aag aat gcc acc aat gtc gag cag gcg ttc 717 Phe Leu Glu Thr Ser Ala Lys Asn Ala Thr Asn Val Glu Gln Ala Phe 150 155 160 atg acc atg gct gct gaa atc aaa aag cgg atg ggg cct gga gca gcc 765 Met Thr Met Ala Ala Glu Ile Lys Lys Arg Met Gly Pro Gly Ala Ala 165 170 175 tct ggg ggc gag cgg ccc aat ctc aag atc gac agc acc cct gta aag 813 Ser Gly Gly Glu Arg Pro Asn Leu Lys Ile Asp Ser Thr Pro Val Lys 180 185 190 ccg gct ggc ggt ggc tgt tgc tag gaggggcaca tggagtggga caggaggggg 867 Pro Ala Gly Gly Gly Cys Cys * 195 200 caccttctcc agatgatgtc cctggagggg gcaggaggta cctccctctc cctctcctgg 927 ggcatttgag tctgtggctt tggggtgtcc tgggctcccc atctccttct ggcccatctg 987 cctgctgccc tgagccccgg ttctgtcagg gtccctaagg gaggacactc agggcctgtg 1047 gccaggcagg gcggaggcct gctgtgctgt tgcctctagg tgactttcca agatgccccc 1107 ctacacacct ttctttggaa cgagggctct tctgtcggtg tccctcccac ccccatgtat 1167 gctgcactgg gttctctcct tcttcttcct gctgtcctgc ccaagaactg agggtctccc 1227 cggcctctac tgccctggct gcagtcagtg cccagggcga ggaatgtggc caggggatcc 1287 aggacctggg atccagggcc ctgggctgga cctcaggaca ggcatggagg ccacaggggc 1347 ccagcagccc accctttcct ctccccactg cctcctctcc cttcctacac tcccagctcg 1407 agccgtccag ctgcggtggg atctgagtat atctagggcg ggtgggcggg tagcagtgct 1467 gggcctgtgt cttgagcctg gagggagtct gctcctgccg ccctctgccc tgccagagac 1527 agacccatgc gctgcctgcc caccgtgccc ctttgtcccc atgtcaggcg gaggcggaag 1587 gcccaccgtg ccagaggctg ggcaccagcc ttaaccctca ctctgctagc acctcctccc 1647 tttccccaag gtagcacatc tggctcactc cccactccgt ctctggagcc caccagggaa 1707 ggccctcatc ccctgccgct acttctctgg ggaatgtggg ttccatccag gattgggggc 1767 ctctctgctc acccactctg cacccaggat cctagtcccc tgccctctgg cacagctgct 1827 tcctgcaaga aagcaagtct ttggtctccc tgagaagcca tgtccctcgt gctgtctctt 1887 gcctgtccca cctgtgccct gccctccagc ttgtatttaa gtccctgggc tgcccccttg 1947 gggtgccccc cgctcccagg ttcccctctg gtgtcatgtc aggcattttg caaggaaaag 2007 ccacttgggg aaagatggaa aaggacaaaa aaaattaata aatttccatt ggccctcgga 2067 aaaaaaaaaa a 2078 6 1125 DNA Homo sapiens CDS (139)..(1005) 6 ttaacatgca ctaaagggaa taagcttggc accgcggcgc accccgcggc caggaccccg 60 gcctccgcac agaggcggga gacccggagc agcagcagag gcaggaggac taggaggagg 120 aggaggagcc gtcgcagg atg aag gat cgg act caa gag ctg cgg agt gcg 171 Met Lys Asp Arg Thr Gln Glu Leu Arg Ser Ala 1 5 10 aaa gac agt gat gat gaa gag gag gtg gtc cac gtg gat cgg gac cac 219 Lys Asp Ser Asp Asp Glu Glu Glu Val Val His Val Asp Arg Asp His 15 20 25 ttc atg gat gag ttc ttt gaa cag gtg gaa gag atc cgg ggc tgc att 267 Phe Met Asp Glu Phe Phe Glu Gln Val Glu Glu Ile Arg Gly Cys Ile 30 35 40 gag aaa ctg tcg gag gat gtg gag cag gtg aaa aaa cag cat agc gcc 315 Glu Lys Leu Ser Glu Asp Val Glu Gln Val Lys Lys Gln His Ser Ala 45 50 55 atc ctg gcc gca ccc aac cca gat gag aag acc aaa cag gag ctg gag 363 Ile Leu Ala Ala Pro Asn Pro Asp Glu Lys Thr Lys Gln Glu Leu Glu 60 65 70 75 gat ctc act gca gac atc aag aag acg gcc aac aag gtt cgg tcc aaa 411 Asp Leu Thr Ala Asp Ile Lys Lys Thr Ala Asn Lys Val Arg Ser Lys 80 85 90 ttg aaa gcg atc gag caa agc att gaa cag gag gag ggg ctg aac cgt 459 Leu Lys Ala Ile Glu Gln Ser Ile Glu Gln Glu Glu Gly Leu Asn Arg 95 100 105 tcc tcc gcg gac ctg cgc atc cgc aag acc cag cac tcc aca ctg tcc 507 Ser Ser Ala Asp Leu Arg Ile Arg Lys Thr Gln His Ser Thr Leu Ser 110 115 120 cgg aag ttc gtg gag gta atg acc gaa tat aac gcg acc cag tcc aag 555 Arg Lys Phe Val Glu Val Met Thr Glu Tyr Asn Ala Thr Gln Ser Lys 125 130 135 tac cgg gac cgc tgc aag gac cgg atc cag cgg caa ctg gag atc act 603 Tyr Arg Asp Arg Cys Lys Asp Arg Ile Gln Arg Gln Leu Glu Ile Thr 140 145 150 155 gga agg acc acc acc aac gaa gaa ctg gaa gac atg ctg gag agc ggg 651 Gly Arg Thr Thr Thr Asn Glu Glu Leu Glu Asp Met Leu Glu Ser Gly 160 165 170 aag ctg gcc atc ttc aca gat gac atc aaa atg gac tca cag atg acg 699 Lys Leu Ala Ile Phe Thr Asp Asp Ile Lys Met Asp Ser Gln Met Thr 175 180 185 aag cag gcg ctg aat gag att gag acg agg cac aat gag atc atc aag 747 Lys Gln Ala Leu Asn Glu Ile Glu Thr Arg His Asn Glu Ile Ile Lys 190 195 200 ctg gag acc agc atc cgc gag ctg cac gat atg ttt gtg gac atg gcc 795 Leu Glu Thr Ser Ile Arg Glu Leu His Asp Met Phe Val Asp Met Ala 205 210 215 atg ctc gta gag agc cag gga gag atg att gac cgc atc gag tac aac 843 Met Leu Val Glu Ser Gln Gly Glu Met Ile Asp Arg Ile Glu Tyr Asn 220 225 230 235 gtg gaa cat tct gtg gac tac gtg gag cga gct gtg tct gac acc aag 891 Val Glu His Ser Val Asp Tyr Val Glu Arg Ala Val Ser Asp Thr Lys 240 245 250 aaa gca gtg aaa tat cag agc aag gcc cgg agg aag aaa atc atg atc 939 Lys Ala Val Lys Tyr Gln Ser Lys Ala Arg Arg Lys Lys Ile Met Ile 255 260 265 atc att tgc tgt gtg gtg ctg ggg gtg gtc ttg gcg tca tcc att ggg 987 Ile Ile Cys Cys Val Val Leu Gly Val Val Leu Ala Ser Ser Ile Gly 270 275 280 ggg acg ctg ggc ttg tag gccccc cccacccttc tctctcccag acccttcccc 1041 Gly Thr Leu Gly Leu * 285 caccacatcg ggagcaatac ccccaccacc cctttcactc catccccgga cgcgtgggtc 1101 gacccgggaa ttccggaccg gtac 1125 7 2839 DNA Homo sapiens CDS (96)..(2096) 7 aggatcccct gtaacaggct gatgctgagg acctccttga ctccttcctt agcaacattc 60 tacaggactg cagacaccac ctgtgtgaac cggac atg aaa ctg gtg tgg cct 113 Met Lys Leu Val Trp Pro 1 5 agt gcc aag ctg ttg cag gca gct gca ggt gca tct gcc cgg gcc tgt 161 Ser Ala Lys Leu Leu Gln Ala Ala Ala Gly Ala Ser Ala Arg Ala Cys 10 15 20 gac tct gac acc agc aat gta ccg cct tta ctg ctg gaa cag ttc cgc 209 Asp Ser Asp Thr Ser Asn Val Pro Pro Leu Leu Leu Glu Gln Phe Arg 25 30 35 aag cac agt cag agc agc cag cgg cgg aca atc ctt gaa atg ctc ctg 257 Lys His Ser Gln Ser Ser Gln Arg Arg Thr Ile Leu Glu Met Leu Leu 40 45 50 ggt ttc ttg aag ctg cag cag aaa tgg agc tat gaa gac aaa gat caa 305 Gly Phe Leu Lys Leu Gln Gln Lys Trp Ser Tyr Glu Asp Lys Asp Gln 55 60 65 70 agg cct ctg aat ggc ttc aag gac cag ctg tgc tca ctg gta ttc atg 353 Arg Pro Leu Asn Gly Phe Lys Asp Gln Leu Cys Ser Leu Val Phe Met 75 80 85 gct cta aca gac ccc agc acc cag ctt cag ctt gtt ggc atc cgt aca 401 Ala Leu Thr Asp Pro Ser Thr Gln Leu Gln Leu Val Gly Ile Arg Thr 90 95 100 ctg aca gtc ttg ggt gcc cag cca gat ctc cta tct tat gag gac ttg 449 Leu Thr Val Leu Gly Ala Gln Pro Asp Leu Leu Ser Tyr Glu Asp Leu 105 110 115 gag ctg gca gtg ggt cac ctg tac aga ctg agc ttc ctg aag gag gat 497 Glu Leu Ala Val Gly His Leu Tyr Arg Leu Ser Phe Leu Lys Glu Asp 120 125 130 tcc cag agt tgc agg gtg gca gca ctg gaa gca tca gga acc ctg gct 545 Ser Gln Ser Cys Arg Val Ala Ala Leu Glu Ala Ser Gly Thr Leu Ala 135 140 145 150 gct ctc tac cct gtg gcc ttc agc agc cac ctc gta ccc aag ctc gct 593 Ala Leu Tyr Pro Val Ala Phe Ser Ser His Leu Val Pro Lys Leu Ala 155 160 165 gag gag ctg cgt gta ggg gag tca aat ttg act aac gga gat gag ccc 641 Glu Glu Leu Arg Val Gly Glu Ser Asn Leu Thr Asn Gly Asp Glu Pro 170 175 180 acc caa tgc tcc cgg cat ctg tgc tgt ctg caa gcc ttg tca gct gta 689 Thr Gln Cys Ser Arg His Leu Cys Cys Leu Gln Ala Leu Ser Ala Val 185 190 195 tca aca cat ccc agc atc gtc aag gag aca ctg cct ctg ctg ctg cag 737 Ser Thr His Pro Ser Ile Val Lys Glu Thr Leu Pro Leu Leu Leu Gln 200 205 210 cat ctc tgg caa gtg aac aga ggg aat atg gtt gca caa tcc agt gac 785 His Leu Trp Gln Val Asn Arg Gly Asn Met Val Ala Gln Ser Ser Asp 215 220 225 230 gtt att gct gtc tgt cag agc ctc aga cag atg gca gaa aaa tgt cag 833 Val Ile Ala Val Cys Gln Ser Leu Arg Gln Met Ala Glu Lys Cys Gln 235 240 245 cag gac cct gag agt tgc tgg tat ttc cac cag aca gct ata cct tgc 881 Gln Asp Pro Glu Ser Cys Trp Tyr Phe His Gln Thr Ala Ile Pro Cys 250 255 260 ctg ctt gcc ttg gct gtg cag gcc tct atg cca gag aag gag ccc tca 929 Leu Leu Ala Leu Ala Val Gln Ala Ser Met Pro Glu Lys Glu Pro Ser 265 270 275 gtt ctg aga aaa gta cta ttg gag gat gag gtg ttg gct gcc atg gtg 977 Val Leu Arg Lys Val Leu Leu Glu Asp Glu Val Leu Ala Ala Met Val 280 285 290 tct gtc att ggc act gct aca acc cac ctg agc cct gag tta gct gcc 1025 Ser Val Ile Gly Thr Ala Thr Thr His Leu Ser Pro Glu Leu Ala Ala 295 300 305 310 cag agt gtg aca cac att gtg ccc ctc ttc ttg gat ggc aac gtg tcc 1073 Gln Ser Val Thr His Ile Val Pro Leu Phe Leu Asp Gly Asn Val Ser 315 320 325 ttt ctg cct gaa aac agc ttc ccg agc aga ttc cag cca ttc cag gat 1121 Phe Leu Pro Glu Asn Ser Phe Pro Ser Arg Phe Gln Pro Phe Gln Asp 330 335 340 ggc tcc tca ggg cag agg cgg ctg att gca ctg ctt atg gcc ttt gtc 1169 Gly Ser Ser Gly Gln Arg Arg Leu Ile Ala Leu Leu Met Ala Phe Val 345 350 355 tgc tcc ctg cct cga aat gtg gaa atc cct cag ctg aac caa ctc atg 1217 Cys Ser Leu Pro Arg Asn Val Glu Ile Pro Gln Leu Asn Gln Leu Met 360 365 370 cgg gag ctt ttg gaa ctg agc tgc tgc cac agc tgc ccc ttt tct tcc 1265 Arg Glu Leu Leu Glu Leu Ser Cys Cys His Ser Cys Pro Phe Ser Ser 375 380 385 390 acc gct gct gcc aag tgc ttt gca gga ctc ctc aac aag cac cct gca 1313 Thr Ala Ala Ala Lys Cys Phe Ala Gly Leu Leu Asn Lys His Pro Ala 395 400 405 ggg cag cag ctg gat gaa ttc cta cag cta gct gtg gac aaa gtg gag 1361 Gly Gln Gln Leu Asp Glu Phe Leu Gln Leu Ala Val Asp Lys Val Glu 410 415 420 gct ggc ctg gac tct ggg ccc tgt cgt agt cag gcc ttc act ctt ctt 1409 Ala Gly Leu Asp Ser Gly Pro Cys Arg Ser Gln Ala Phe Thr Leu Leu 425 430 435 ctc tgg gta aca aag gcc cta gtg ctc aga tac cat cct ctc agc tcc 1457 Leu Trp Val Thr Lys Ala Leu Val Leu Arg Tyr His Pro Leu Ser Ser 440 445 450 tgc ctt aca gcc cgg ctc atg ggc ctc ctg agt gac cca gaa tta ggt 1505 Cys Leu Thr Ala Arg Leu Met Gly Leu Leu Ser Asp Pro Glu Leu Gly 455 460 465 470 cca gca gca gct gat ggc ttc tct ctg ctc atg tct gac tgc act gat 1553 Pro Ala Ala Ala Asp Gly Phe Ser Leu Leu Met Ser Asp Cys Thr Asp 475 480 485 gtg ctg act cgt gct ggc cat gcc gaa gtg cgg atc atg ttc cgc cag 1601 Val Leu Thr Arg Ala Gly His Ala Glu Val Arg Ile Met Phe Arg Gln 490 495 500 cgg ttc ttc aca gat aat gtg cct gct ttg gtc cag ggc ttc cat gct 1649 Arg Phe Phe Thr Asp Asn Val Pro Ala Leu Val Gln Gly Phe His Ala 505 510 515 gct ccc caa gat gtg aag cca aac tac ttg aag ggt ctt tct cat gta 1697 Ala Pro Gln Asp Val Lys Pro Asn Tyr Leu Lys Gly Leu Ser His Val 520 525 530 ctt aac agg ctg ccc aag cct gta ctc ttg cca gag ctg ccc acg ctt 1745 Leu Asn Arg Leu Pro Lys Pro Val Leu Leu Pro Glu Leu Pro Thr Leu 535 540 545 550 ctt tcc ttg ctg ctg gag gcc ctg tcc tgc cct gac tgt gtg gtg cag 1793 Leu Ser Leu Leu Leu Glu Ala Leu Ser Cys Pro Asp Cys Val Val Gln 555 560 565 ctc tcc acc ctc agc tgc ctt cag cct ctt cta ctg gaa gca ccc caa 1841 Leu Ser Thr Leu Ser Cys Leu Gln Pro Leu Leu Leu Glu Ala Pro Gln 570 575 580 gtc atg agt ctt cac gtg gac acc ctc gtc acc aag ttt ctg aac ctc 1889 Val Met Ser Leu His Val Asp Thr Leu Val Thr Lys Phe Leu Asn Leu 585 590 595 agc tct agc cct tcc atg gct gtc cgg atc gcc gca ctg cag tgc atg 1937 Ser Ser Ser Pro Ser Met Ala Val Arg Ile Ala Ala Leu Gln Cys Met 600 605 610 cat gct ctc act cgc ctg ccc acc cct gtg ctg ctg ccg tac aaa cca 1985 His Ala Leu Thr Arg Leu Pro Thr Pro Val Leu Leu Pro Tyr Lys Pro 615 620 625 630 cag gtg att cgg gcc tta gcc aaa ccc ctg gat gac aag aag aga ctg 2033 Gln Val Ile Arg Ala Leu Ala Lys Pro Leu Asp Asp Lys Lys Arg Leu 635 640 645 gtg cgc aag gaa gca gtg tca gcc aga ggg gag tgg ttt ctg ttg ggg 2081 Val Arg Lys Glu Ala Val Ser Ala Arg Gly Glu Trp Phe Leu Leu Gly 650 655 660 agc cct ggc agc tga gccctcagtc ctggcctaga ctgttctgac aatctaacct 2136 Ser Pro Gly Ser * 665 gggattacta actgttgagc catcttcccc aaagcaggga aaccactggt ctctgactgc 2196 ctttcccaca gacacagcac aaatgctagg cctctgttgc atggctgtac aaagaacata 2256 agagtccata tttctagtgg atttgtaaaa taagtgtgtg tgagacactt gcgtttgaag 2316 aaagatctag ggtcctgggt ctcttgcatt tatatgtcag aaaaggggcg atatgctgct 2376 gaggggtgag tgcatatgag tgtggccctg aggaccaggg ctggcagatg ttgtctacct 2436 gctgaagaat aaagatttct tttggtaatg gggctgtcag atatttcccc caccccacat 2496 atgcctcata taaccagagg catcactgga ataacaatga cagggattgg ccaaaaattt 2556 ttcagggtaa ccaggacagt agcacatgcc tgtaacccca acacttcagg agtctgaggc 2616 aggcggatca cttgaggcta ggagtttgag actagccctg gcagtatagc aagaacccat 2676 ctctacaaat aattaaaaaa aaaaaaaaaa gggcggccgc tctagagtat ccctcgaggg 2736 gcccaagctt acgcgtaccc agctttcttg tacaaagtgg tccctatagt gagtcgtaaa 2796 taagcaggga ctgtcaagga ctctatgcct gggggcgcct ctg 2839 8 1837 DNA Homo sapiens CDS (258)..(1304) 8 tgggacgtga ttagaattag gagaattcga atgatgcata taactatcta ttcgatgatg 60 aagatacccc accaaaccca aaaaaagaga tctctcgagg atccgaattc gcggccgcgt 120 cgacgcgacc gttccggcgg ccattgcgaa aacttcccca cggctactgc gtccacgtgg 180 cggtggcgtg gggactccct gaaagcagag cggcagggcg cccggaagtc gtgagtcgag 240 tcttcccggg ctaatcc atg ccg ggt tgg agg ctg ctg acg cag gtc ggc 290 Met Pro Gly Trp Arg Leu Leu Thr Gln Val Gly 1 5 10 gcc cag gtg ctg ggt cga ctc ggg gac ggc ctg ggt gct gcc ctg ggc 338 Ala Gln Val Leu Gly Arg Leu Gly Asp Gly Leu Gly Ala Ala Leu Gly 15 20 25 ccg ggg aac aga aca cac atc tgg ctt ttt gtt aga ggt ctt cat gga 386 Pro Gly Asn Arg Thr His Ile Trp Leu Phe Val Arg Gly Leu His Gly 30 35 40 aag agt ggt aca tgg tgg gat gag cat ctt tct gaa gaa aat gtc cca 434 Lys Ser Gly Thr Trp Trp Asp Glu His Leu Ser Glu Glu Asn Val Pro 45 50 55 ttc att aag cag ttg gtc tct gat gaa gat aaa gcc caa tta gca agt 482 Phe Ile Lys Gln Leu Val Ser Asp Glu Asp Lys Ala Gln Leu Ala Ser 60 65 70 75 aaa ctg tgt cct ctg aaa gat gaa cca tgg cct ata cat cct tgg gaa 530 Lys Leu Cys Pro Leu Lys Asp Glu Pro Trp Pro Ile His Pro Trp Glu 80 85 90 cca ggt tcc ttt aga gtt ggt ctt att gcc ttg aag ctg ggc atg atg 578 Pro Gly Ser Phe Arg Val Gly Leu Ile Ala Leu Lys Leu Gly Met Met 95 100 105 cct tta tgg acc aag gat ggt caa aag cat gtg gtc aca tta ctt cag 626 Pro Leu Trp Thr Lys Asp Gly Gln Lys His Val Val Thr Leu Leu Gln 110 115 120 gta caa gac tgt cat gtc tta aaa tat acg tca aag gaa aac tgt aat 674 Val Gln Asp Cys His Val Leu Lys Tyr Thr Ser Lys Glu Asn Cys Asn 125 130 135 gga aaa atg gca acc ctg tct gta gga gga aaa act gta tca cgt ttt 722 Gly Lys Met Ala Thr Leu Ser Val Gly Gly Lys Thr Val Ser Arg Phe 140 145 150 155 cgt aaa gct aca tcc ata ttg gaa ttt tac cgg gaa ctt gga ttg ccg 770 Arg Lys Ala Thr Ser Ile Leu Glu Phe Tyr Arg Glu Leu Gly Leu Pro 160 165 170 ccg aaa cag aca gtt aaa atc ttt aat ata aca gat aat gct gca att 818 Pro Lys Gln Thr Val Lys Ile Phe Asn Ile Thr Asp Asn Ala Ala Ile 175 180 185 aaa cca ggc act cct ctt tat gct gct cac ttt cgt cca gga cag tat 866 Lys Pro Gly Thr Pro Leu Tyr Ala Ala His Phe Arg Pro Gly Gln Tyr 190 195 200 gtg gat gtc aca gcc aaa act att ggt aaa ggt ttt caa ggt gtc atg 914 Val Asp Val Thr Ala Lys Thr Ile Gly Lys Gly Phe Gln Gly Val Met 205 210 215 aaa aga tgg gga ttt aaa ggc cag cct gct acg cat ggt caa acg aaa 962 Lys Arg Trp Gly Phe Lys Gly Gln Pro Ala Thr His Gly Gln Thr Lys 220 225 230 235 acc cac agg aga cct gga gct gtt gca act ggt gat att ggc aga gtc 1010 Thr His Arg Arg Pro Gly Ala Val Ala Thr Gly Asp Ile Gly Arg Val 240 245 250 tgg cct gga act aaa atg cct gga aaa atg gga aac ata tac agg aca 1058 Trp Pro Gly Thr Lys Met Pro Gly Lys Met Gly Asn Ile Tyr Arg Thr 255 260 265 gaa tat gga ctg aaa gtg tgg aga ata aac aca aag cac aac ata atc 1106 Glu Tyr Gly Leu Lys Val Trp Arg Ile Asn Thr Lys His Asn Ile Ile 270 275 280 tat gta aat ggc tct gta cct gga cat aaa aat tgc tta gta aag gtc 1154 Tyr Val Asn Gly Ser Val Pro Gly His Lys Asn Cys Leu Val Lys Val 285 290 295 aaa gat tct aaa ctg cct gca tat aag gat ctc ggt aaa aat cta cca 1202 Lys Asp Ser Lys Leu Pro Ala Tyr Lys Asp Leu Gly Lys Asn Leu Pro 300 305 310 315 ttc cct aca tat ttt cct gat gga gat gaa gag gaa ctg cca gaa gat 1250 Phe Pro Thr Tyr Phe Pro Asp Gly Asp Glu Glu Glu Leu Pro Glu Asp 320 325 330 ttg tat gat gaa aac gtg tgt cag ccc ggt gcg cct tct att aca ttt 1298 Leu Tyr Asp Glu Asn Val Cys Gln Pro Gly Ala Pro Ser Ile Thr Phe 335 340 345 gcc taa catctttgga cgtggcagaa ccttacatat tctgtgagct tcgatgagcc 1354 Ala * agagtgatat cataaccacc agaaatcata ctctcctttc ttagtcacaa caaaatcaca 1414 catgtcatct ttgtcaaggg cataaatata tcattcatac ccccattaaa ttttgttaga 1474 aaaattacca cattaaatat atgagttaag tagattggat ttgctgaaat tggtgttggg 1534 catattagca aaatattctt aatttgtgga ctcgattctt ttttactaca tatttcccaa 1594 gttatcttaa gatgtctgta aatttaactt ttattaaagt tttgtcaatc tttgtgaaat 1654 agtggttgtg gaacagtaga aaaccatatg gggactatag tgcaacctat ttgggtaaag 1714 aaaccatttg ctaaaatgga gaaagtaaat agatttttat ttaaattaca gaaacatgtt 1774 aaaggccgga caaaggaaag acaataaaat cataaattat cggtcctgtt taaaaaaaaa 1834 aaa 1837 9 1913 DNA Homo sapiens CDS (541)..(954) 9 taagcttgcg gccgcaattt tttttttttt tttttccctc aaacaagaag ggcttccttt 60 aatagacttt aatatagctc atttacaatc gtatgaaaat ccatcaagat caatcatgtt 120 gcacaatccc taaaggataa aactgaggga agtcatggat aattttcaca gcttcattgt 180 aaagtccaag atcaaaagga tggctatctt ctttaaactg gacataagat gtacacaaga 240 tggtggcctt ggcggttact cttccaacca cttccacaat tccagagatt tcttcatcaa 300 ggggttccat caactcgatg gttccatttt ttccttctcc atctgaaaga ataaacattt 360 ttccggtggg atgaatcttt ctgcgatcac aggattcccg gcggtgactt gaccccggaa 420 gtggggtgtg aagctccggt gctggtgcgg cgggggactg cggggccagc ctcagtacca 480 gcagcagcag cagcagcagc agcagcagca gcagcagcag cagcagcagc agcagcagca 540 atg ttt cac ttc ttc aga aag cct ccg gaa tct aaa aag ccc tca gta 588 Met Phe His Phe Phe Arg Lys Pro Pro Glu Ser Lys Lys Pro Ser Val 1 5 10 15 cca gag aca gaa gca gat gga ttc gtc ctt tta gga gat aca aca gat 636 Pro Glu Thr Glu Ala Asp Gly Phe Val Leu Leu Gly Asp Thr Thr Asp 20 25 30 gag caa aga atg aca gca aga ggc aaa act tcg gac ata gag gcc aac 684 Glu Gln Arg Met Thr Ala Arg Gly Lys Thr Ser Asp Ile Glu Ala Asn 35 40 45 caa cct ttg gag acc aac aaa gaa aat tca tcc agt gtg act gta tca 732 Gln Pro Leu Glu Thr Asn Lys Glu Asn Ser Ser Ser Val Thr Val Ser 50 55 60 gac cct gag atg gaa aat aag gca ggc cag act ctg gag aac agc tca 780 Asp Pro Glu Met Glu Asn Lys Ala Gly Gln Thr Leu Glu Asn Ser Ser 65 70 75 80 tta atg gcc gag ctc ctg agc gat gtg ccg ttc acc ctg gcc ccg cat 828 Leu Met Ala Glu Leu Leu Ser Asp Val Pro Phe Thr Leu Ala Pro His 85 90 95 gtg ctg gca gta cag ggc acc atc act gac ctt ccc gac cac tta ctc 876 Val Leu Ala Val Gln Gly Thr Ile Thr Asp Leu Pro Asp His Leu Leu 100 105 110 tcc tat gat ggc agc gaa aac tta tca cgg ttt tgg tat gat ttc act 924 Ser Tyr Asp Gly Ser Glu Asn Leu Ser Arg Phe Trp Tyr Asp Phe Thr 115 120 125 ctt gaa aat tca gtg ctc tgt gat tca taa c ctttatgtct gtttgcacct 975 Leu Glu Asn Ser Val Leu Cys Asp Ser * 130 135 taacagcttt aaaatatgtt cgcctatttt atctaacctg tttgatgttc tttgccgttt 1035 cactgtttaa ggtcctcagc aaggatcata aagcaaagaa aatagcatta tgttcactac 1095 tctattttta agaaaaaggt acatttgtat acaaattgaa cttaagttcc acttcctttc 1155 tcccatataa taaatataca aattaggcta tgaaggtttt aggaaaggac tcgattcctt 1215 cagatggtct ctcaaaatat aacacctcaa atttatctta gaagaactgt gaaaaagaat 1275 tgtggcattt ttcagtcact acagctccga agtctgagca agaagtgggt gtgaagtctc 1335 ctctctggtt tgtaggaagt tgaattagtg ttattcctgc atttttttct tccacagtgt 1395 ttatgaatga attcagaaaa aaagttgcca gttagcttaa tttcttcaga tgcattgatg 1455 tggaaattta aaacttgtcc taaacagcag tgctagtttg ctgatacaag gatgctagac 1515 ctgctctgcg tgctctttct ggagtggccc atttcggttt tctgaaaccc atggcagccc 1575 tttccatcgt gaataatcgt tgtgttccca cctttgttct ctgccttttt gctacttcag 1635 tgtgctctct gaccagcttt ccaaagaact ttccctgctt ctgcctcggt tgccatttgc 1695 tctccttacc acatatgttc ccagtttatg aagagatcca catttccttt caacccctct 1755 gcctcctgaa gaaaaacatc tcatgatgat acatattatt gctgataaca cccttattta 1815 gaaatttgtt ggccacaata cagatggaaa tgttttactg ctggaaaact gaaatcaact 1875 catttccaat tagagtatag gcagaacacc tagatatg 1913 10 3840 DNA Homo sapiens CDS (81)..(3572) 10 cggtgaaggt gcaggcccgg ccccgccact gccgcagcca ggagatggtt cgggcctagc 60 ggagccggga ctggagcaac atg aac cgt gaa gac cgg aat gtg ctg cgt 110 Met Asn Arg Glu Asp Arg Asn Val Leu Arg 1 5 10 atg aaa gaa cgg gaa agg cgg aat cag gaa att cag cag ggc gaa gac 158 Met Lys Glu Arg Glu Arg Arg Asn Gln Glu Ile Gln Gln Gly Glu Asp 15 20 25 gcc ttc cca cct agc tct cct ctc ttt gca gag cca tac aaa gtt act 206 Ala Phe Pro Pro Ser Ser Pro Leu Phe Ala Glu Pro Tyr Lys Val Thr 30 35 40 agc aaa gaa gat aag tta tca agt cgt att cag agt atg ctt gga aac 254 Ser Lys Glu Asp Lys Leu Ser Ser Arg Ile Gln Ser Met Leu Gly Asn 45 50 55 tac gat gtc atg aaa gat ttc ata gga gac aga tct ata cca aag ctt 302 Tyr Asp Val Met Lys Asp Phe Ile Gly Asp Arg Ser Ile Pro Lys Leu 60 65 70 gtt gca att ccc aag cct aca gta cca cca tca gca gat gaa aaa tct 350 Val Ala Ile Pro Lys Pro Thr Val Pro Pro Ser Ala Asp Glu Lys Ser 75 80 85 90 aac cca aat ttc ttt gaa cag aga cat gga ggc tct cat cag agt agc 398 Asn Pro Asn Phe Phe Glu Gln Arg His Gly Gly Ser His Gln Ser Ser 95 100 105 aaa tgg act cca gta gga ccc gca ccc agc act tct cag tct cag aaa 446 Lys Trp Thr Pro Val Gly Pro Ala Pro Ser Thr Ser Gln Ser Gln Lys 110 115 120 cgg tcc tca ggc tta cag agt gga cat agt agc cag cgg acc agc gca 494 Arg Ser Ser Gly Leu Gln Ser Gly His Ser Ser Gln Arg Thr Ser Ala 125 130 135 ggt agc agt agt ggc act aac agt agt ggt cag agg cac gac cgt gag 542 Gly Ser Ser Ser Gly Thr Asn Ser Ser Gly Gln Arg His Asp Arg Glu 140 145 150 tca tat aac aat agt ggg agc agt agc cgg aaa aaa ggc cag cat gga 590 Ser Tyr Asn Asn Ser Gly Ser Ser Ser Arg Lys Lys Gly Gln His Gly 155 160 165 170 tca gaa cac tcc aaa tca cgt tct tcc agc cct gga aaa ccc cag gct 638 Ser Glu His Ser Lys Ser Arg Ser Ser Ser Pro Gly Lys Pro Gln Ala 175 180 185 gtt tct tca tta aac tct agt cat tcc agg tct cat ggg aat gat cac 686 Val Ser Ser Leu Asn Ser Ser His Ser Arg Ser His Gly Asn Asp His 190 195 200 cat agc aag gaa cat caa cgc tcc aaa tca cct cgg gac cct gat gca 734 His Ser Lys Glu His Gln Arg Ser Lys Ser Pro Arg Asp Pro Asp Ala 205 210 215 aac tgg gat tct cct tcc cgt gta cct ttt tca agt ggg cag cac tca 782 Asn Trp Asp Ser Pro Ser Arg Val Pro Phe Ser Ser Gly Gln His Ser 220 225 230 act caa tct ttc cca ccc tca ttg atg tca aag tcc aat tca atg tta 830 Thr Gln Ser Phe Pro Pro Ser Leu Met Ser Lys Ser Asn Ser Met Leu 235 240 245 250 cag aaa ccc act gcc tat gtg cgg ccc atg gac gga cag gag tcc atg 878 Gln Lys Pro Thr Ala Tyr Val Arg Pro Met Asp Gly Gln Glu Ser Met 255 260 265 gaa cca aag ctg tcc tct gag cac tac agc agc caa tcc cat ggc aac 926 Glu Pro Lys Leu Ser Ser Glu His Tyr Ser Ser Gln Ser His Gly Asn 270 275 280 agc atg act gag ctg aag ccc agc agc aaa gca cat ctc acc aag ctg 974 Ser Met Thr Glu Leu Lys Pro Ser Ser Lys Ala His Leu Thr Lys Leu 285 290 295 aaa ata cct tcc caa cca ctg gat gca tca gct tct ggt gat gtg agc 1022 Lys Ile Pro Ser Gln Pro Leu Asp Ala Ser Ala Ser Gly Asp Val Ser 300 305 310 tgt gtg gat gaa atc cta aaa gag atg acg cat tca tgg cct ccc cct 1070 Cys Val Asp Glu Ile Leu Lys Glu Met Thr His Ser Trp Pro Pro Pro 315 320 325 330 cta acg gct att cat aca cca tgc aaa aca gaa cct tcc aaa ttt cct 1118 Leu Thr Ala Ile His Thr Pro Cys Lys Thr Glu Pro Ser Lys Phe Pro 335 340 345 ttt cca act aag gag tct cag cag tcc aat ttt ggc act gga gaa caa 1166 Phe Pro Thr Lys Glu Ser Gln Gln Ser Asn Phe Gly Thr Gly Glu Gln 350 355 360 aaa aga tat aat cct tct aaa act tca aat ggg cac cag tct aaa tct 1214 Lys Arg Tyr Asn Pro Ser Lys Thr Ser Asn Gly His Gln Ser Lys Ser 365 370 375 atg tta aaa gat gac tta aaa cta agc agc agt gaa gac agt gat ggg 1262 Met Leu Lys Asp Asp Leu Lys Leu Ser Ser Ser Glu Asp Ser Asp Gly 380 385 390 gaa cag gat tgt gat aag aca atg ccg agg agt aca cca gga agt aac 1310 Glu Gln Asp Cys Asp Lys Thr Met Pro Arg Ser Thr Pro Gly Ser Asn 395 400 405 410 tct gaa cct tca cac cat aat agt gaa gga gca gat aac tcc agg gat 1358 Ser Glu Pro Ser His His Asn Ser Glu Gly Ala Asp Asn Ser Arg Asp 415 420 425 gat tct agt agc cac agt gga tct gaa agc agc tct gga tct gac tca 1406 Asp Ser Ser Ser His Ser Gly Ser Glu Ser Ser Ser Gly Ser Asp Ser 430 435 440 gag agt gaa agt agt tcc agt gac agt gag gca aat gag cca tcc cag 1454 Glu Ser Glu Ser Ser Ser Ser Asp Ser Glu Ala Asn Glu Pro Ser Gln 445 450 455 agt gca tct ccc gag cct gaa ccc ccg cca aca aac aaa tgg caa ctt 1502 Ser Ala Ser Pro Glu Pro Glu Pro Pro Pro Thr Asn Lys Trp Gln Leu 460 465 470 gat aat tgg ctg aat aaa gtg aac cca cat aaa gtg tca ccc gcc tct 1550 Asp Asn Trp Leu Asn Lys Val Asn Pro His Lys Val Ser Pro Ala Ser 475 480 485 490 tca gtg gac agt aac atc cca tca tct caa ggc tac aaa aag gaa ggc 1598 Ser Val Asp Ser Asn Ile Pro Ser Ser Gln Gly Tyr Lys Lys Glu Gly 495 500 505 cga gag cag ggc act ggg aat agc tac act gat aca agt gga cct aaa 1646 Arg Glu Gln Gly Thr Gly Asn Ser Tyr Thr Asp Thr Ser Gly Pro Lys 510 515 520 gaa acg agt tcc gct act ccg gga cga gac tcc aaa acc atc caa aag 1694 Glu Thr Ser Ser Ala Thr Pro Gly Arg Asp Ser Lys Thr Ile Gln Lys 525 530 535 gga tca gaa agt ggg cgt ggg agg cag aaa tct cct gca cag agt gac 1742 Gly Ser Glu Ser Gly Arg Gly Arg Gln Lys Ser Pro Ala Gln Ser Asp 540 545 550 agc aca aca cag aga aga act gta ggc aaa aaa caa ccc aaa aag gct 1790 Ser Thr Thr Gln Arg Arg Thr Val Gly Lys Lys Gln Pro Lys Lys Ala 555 560 565 570 gag aag gca gct gct gaa gag cct cgt gga ggc ctg aag ata gaa agt 1838 Glu Lys Ala Ala Ala Glu Glu Pro Arg Gly Gly Leu Lys Ile Glu Ser 575 580 585 gaa acc cct gta gac ttg gct agc agc atg ccc tcc agc aga cac aaa 1886 Glu Thr Pro Val Asp Leu Ala Ser Ser Met Pro Ser Ser Arg His Lys 590 595 600 gca gcc acc aaa ggc tca agg aaa ccc aat ata aag aag gag tct aag 1934 Ala Ala Thr Lys Gly Ser Arg Lys Pro Asn Ile Lys Lys Glu Ser Lys 605 610 615 tct tcc cct cga cct aca gca gag aaa aag aaa tat aag tca aca agt 1982 Ser Ser Pro Arg Pro Thr Ala Glu Lys Lys Lys Tyr Lys Ser Thr Ser 620 625 630 aaa tct tcc cag aaa tca agg gaa atc ata gaa aca gat acc tca tcc 2030 Lys Ser Ser Gln Lys Ser Arg Glu Ile Ile Glu Thr Asp Thr Ser Ser 635 640 645 650 tca gat tca gat gaa agt gag agc ctt cct cct tcc tca caa act cct 2078 Ser Asp Ser Asp Glu Ser Glu Ser Leu Pro Pro Ser Ser Gln Thr Pro 655 660 665 aag tac ccc gag agc aat agg act cct gtt aaa ccc tcc tca gtg gag 2126 Lys Tyr Pro Glu Ser Asn Arg Thr Pro Val Lys Pro Ser Ser Val Glu 670 675 680 gaa gaa gat agc ttt ttt cgg caa cga atg ttc tct cct atg gaa gag 2174 Glu Glu Asp Ser Phe Phe Arg Gln Arg Met Phe Ser Pro Met Glu Glu 685 690 695 aag gaa ctt ctt tca ccc ctc agt gag cct gat gac agg tac cca ctt 2222 Lys Glu Leu Leu Ser Pro Leu Ser Glu Pro Asp Asp Arg Tyr Pro Leu 700 705 710 att gtg aag att gac ctg aat ctt ttg act aga ata cca gga aag cct 2270 Ile Val Lys Ile Asp Leu Asn Leu Leu Thr Arg Ile Pro Gly Lys Pro 715 720 725 730 tac aaa gaa aca gag ccg ccc aag ggg gaa aag aaa aat gtg cca gaa 2318 Tyr Lys Glu Thr Glu Pro Pro Lys Gly Glu Lys Lys Asn Val Pro Glu 735 740 745 aag cac acg aga gag gct cag aaa caa gcc tca gaa aaa gtt tcc aac 2366 Lys His Thr Arg Glu Ala Gln Lys Gln Ala Ser Glu Lys Val Ser Asn 750 755 760 aaa ggc aag agg aag cat aag aat gaa gat gat aac cga gcc agt gag 2414 Lys Gly Lys Arg Lys His Lys Asn Glu Asp Asp Asn Arg Ala Ser Glu 765 770 775 agc aag aaa ccc aaa acg gag gac aag aat tca gca ggc cat aag cca 2462 Ser Lys Lys Pro Lys Thr Glu Asp Lys Asn Ser Ala Gly His Lys Pro 780 785 790 tcc agc aac aga gag tca tct aag cag agt gct gca aaa gaa aag gat 2510 Ser Ser Asn Arg Glu Ser Ser Lys Gln Ser Ala Ala Lys Glu Lys Asp 795 800 805 810 ttg ttg cct tct ccc gct ggg cct gtt cct tca aaa gat cca aaa aca 2558 Leu Leu Pro Ser Pro Ala Gly Pro Val Pro Ser Lys Asp Pro Lys Thr 815 820 825 gag cat ggc tct cgg aag agg act att agt cag tct tct tcc tta aag 2606 Glu His Gly Ser Arg Lys Arg Thr Ile Ser Gln Ser Ser Ser Leu Lys 830 835 840 tca agc agt aac agc aac aag gag acg agt ggc agc agc aaa aac agt 2654 Ser Ser Ser Asn Ser Asn Lys Glu Thr Ser Gly Ser Ser Lys Asn Ser 845 850 855 tcc tcc aca tca aag cag aag aag acc gaa ggg aag act tcc agt agc 2702 Ser Ser Thr Ser Lys Gln Lys Lys Thr Glu Gly Lys Thr Ser Ser Ser 860 865 870 tcc aag gag gtt aag gaa aag gct cca agt agc tcc tct aac tgt cct 2750 Ser Lys Glu Val Lys Glu Lys Ala Pro Ser Ser Ser Ser Asn Cys Pro 875 880 885 890 cca tct gca cca act ctt gat tct tct aag cct cgg aga aca aag ctt 2798 Pro Ser Ala Pro Thr Leu Asp Ser Ser Lys Pro Arg Arg Thr Lys Leu 895 900 905 gtc ttt gat gac aga aat tat tca gca gac cat tat tta caa gaa gca 2846 Val Phe Asp Asp Arg Asn Tyr Ser Ala Asp His Tyr Leu Gln Glu Ala 910 915 920 aaa aag cta aag cac aat gca gat gca ttg tct gat agg ttt gag aaa 2894 Lys Lys Leu Lys His Asn Ala Asp Ala Leu Ser Asp Arg Phe Glu Lys 925 930 935 gct gta tac tat ctt gat gct gtg gta tct ttc att gaa tgt ggg aat 2942 Ala Val Tyr Tyr Leu Asp Ala Val Val Ser Phe Ile Glu Cys Gly Asn 940 945 950 gca tta gag aag aat gct cag gaa tcc aaa tcc cca ttc cct atg tat 2990 Ala Leu Glu Lys Asn Ala Gln Glu Ser Lys Ser Pro Phe Pro Met Tyr 955 960 965 970 tca gag acg gtg gat ctc atc aaa tac act atg aag cta aag aat tac 3038 Ser Glu Thr Val Asp Leu Ile Lys Tyr Thr Met Lys Leu Lys Asn Tyr 975 980 985 ttg gca cca gat gct aca gct gca gat aaa cga ctc aca gta ctt tgc 3086 Leu Ala Pro Asp Ala Thr Ala Ala Asp Lys Arg Leu Thr Val Leu Cys 990 995 1000 ctg cga tgc gag tct ttg ctg tac ctg agg ctg ttc aaa ctg aag aag 3134 Leu Arg Cys Glu Ser Leu Leu Tyr Leu Arg Leu Phe Lys Leu Lys Lys 1005 1010 1015 gaa aat gct ctg aag tac tca aag aca ctg aca gag cac ctg aag aat 3182 Glu Asn Ala Leu Lys Tyr Ser Lys Thr Leu Thr Glu His Leu Lys Asn 1020 1025 1030 tct tat aat aat tct caa gca cca tcg cct ggc ttg gga agc aaa gct 3230 Ser Tyr Asn Asn Ser Gln Ala Pro Ser Pro Gly Leu Gly Ser Lys Ala 1035 1040 1045 1050 gtg ggg atg cct tcc cct gtt tct cca aag ctg tca cca ggc aat tca 3278 Val Gly Met Pro Ser Pro Val Ser Pro Lys Leu Ser Pro Gly Asn Ser 1055 1060 1065 gga aat tat tca tct ggg gcc agt agt gct tct gca agt ggt tct tca 3326 Gly Asn Tyr Ser Ser Gly Ala Ser Ser Ala Ser Ala Ser Gly Ser Ser 1070 1075 1080 gtg acc att cca cag aag atc cac cag atg gca gcc agc tat gtt cag 3374 Val Thr Ile Pro Gln Lys Ile His Gln Met Ala Ala Ser Tyr Val Gln 1085 1090 1095 gtc aca tcc aac ttc ctc tat gcc acc gaa att tgg gac caa gct gaa 3422 Val Thr Ser Asn Phe Leu Tyr Ala Thr Glu Ile Trp Asp Gln Ala Glu 1100 1105 1110 cag ctt tcc aaa gag caa aaa gaa ttc ttt gct gaa ctg gat aaa gta 3470 Gln Leu Ser Lys Glu Gln Lys Glu Phe Phe Ala Glu Leu Asp Lys Val 1115 1120 1125 1130 atg ggc cct ctc atc ttt aat gca agc atc atg aca gat cta gtt cgt 3518 Met Gly Pro Leu Ile Phe Asn Ala Ser Ile Met Thr Asp Leu Val Arg 1135 1140 1145 tat acc cgg cag gga ctg cac tgg ctt cgc cag gat gcc aag ttg ata 3566 Tyr Thr Arg Gln Gly Leu His Trp Leu Arg Gln Asp Ala Lys Leu Ile 1150 1155 1160 tct tga actgaacaca ttctcgttgc ctctgatttt ctccacaaca ctgtgtcaca 3622 Ser * tcacgaagga aaactgccat aacataccac ctagtcgaca ctaagaatga ggaatagttt 3682 tctcctcgtt ggttcatgtg ttgttgtttt tgataatcca aagcgatcat gtcagttggc 3742 cctttaatat ttccaatgtg aaagattatt taaatgcttt taaatctgca gcacattgat 3802 aaaatctcgt gccgaattct tggcctcgag ggccaaat 3840 11 634 DNA Homo sapiens CDS (129)..(515) 11 atttggccct cgaggccaag aattcggcac gaggcgcggt gcgacgaagg agtaggtggt 60 gggatctcac cgtgggtccg attagccttt tctctgcctt gcttgcttga gcttcagcgg 120 aattcgaa atg gct ggc ggt aag gct gga aag gac tcc gga aag gcc aag 170 Met Ala Gly Gly Lys Ala Gly Lys Asp Ser Gly Lys Ala Lys 1 5 10 aca aag gcg gtt tcc cgc tcg cag aga gcc ggc ttg cag ttc cca gtg 218 Thr Lys Ala Val Ser Arg Ser Gln Arg Ala Gly Leu Gln Phe Pro Val 15 20 25 30 ggc cgt att cat cga cac cta aaa tct agg acg acc agt cat gga cgt 266 Gly Arg Ile His Arg His Leu Lys Ser Arg Thr Thr Ser His Gly Arg 35 40 45 gtg ggc gcg act gcc gct gtg tac agc gca gcc atc ctg gag tac ctc 314 Val Gly Ala Thr Ala Ala Val Tyr Ser Ala Ala Ile Leu Glu Tyr Leu 50 55 60 acc gca gag gta ctt gaa ctg gca gga aat gca tca aaa gac tta aag 362 Thr Ala Glu Val Leu Glu Leu Ala Gly Asn Ala Ser Lys Asp Leu Lys 65 70 75 gta aag cgt att acc cct cgt cac ttg caa ctt gct att cgt gga gat 410 Val Lys Arg Ile Thr Pro Arg His Leu Gln Leu Ala Ile Arg Gly Asp 80 85 90 gaa gaa ttg gat tct ctc atc aag gct aca att gct ggt ggt ggt gtc 458 Glu Glu Leu Asp Ser Leu Ile Lys Ala Thr Ile Ala Gly Gly Gly Val 95 100 105 110 att cca cac atc cac aaa tct ctg att ggg aag aaa gga caa cag aag 506 Ile Pro His Ile His Lys Ser Leu Ile Gly Lys Lys Gly Gln Gln Lys 115 120 125 act gtc taa aggatgc ctggattcct tgttatctca ggactctaaa tactctaaca 562 Thr Val * gctgtccagt gttggtgatt ccagtggact gtatctctgt gaaaaacaca attttgcctt 622 tttgtaattc ta 634 12 2752 DNA Homo sapiens CDS (214)..(2082) 12 atcggatccc cttggagacc caagtgatct cgcttttaaa ctcaagcttg gtaccgagct 60 cggatccact agtccagtgt ggtggaattg gacatagaaa aatttcatcg agagtcattc 120 ttctacactc acttgataaa tttcagtgaa acgctgcagc agtgctgtga cctttcgcag 180 ctgtggttcc gagagttctt cctggagctg acc atg ggc agg agg atc cag ttc 234 Met Gly Arg Arg Ile Gln Phe 1 5 ccc att gag atg tcg atg ccc tgg atc ctg acg gac cac atc ctg gag 282 Pro Ile Glu Met Ser Met Pro Trp Ile Leu Thr Asp His Ile Leu Glu 10 15 20 acc aag gag gca tcg atg atg gag tac gtg ctc tac tcc ctg gac ctg 330 Thr Lys Glu Ala Ser Met Met Glu Tyr Val Leu Tyr Ser Leu Asp Leu 25 30 35 tac aat gac agc gcc cac tac gcg ctc acc agg ttc aac aag cag ttc 378 Tyr Asn Asp Ser Ala His Tyr Ala Leu Thr Arg Phe Asn Lys Gln Phe 40 45 50 55 ctg tac gac gaa att gag gcc gag gtg aat cta tgt ttt gac caa ttt 426 Leu Tyr Asp Glu Ile Glu Ala Glu Val Asn Leu Cys Phe Asp Gln Phe 60 65 70 gtt tac aag cta gca gac cag ata ttt gcc tat tat aag gtt atg gca 474 Val Tyr Lys Leu Ala Asp Gln Ile Phe Ala Tyr Tyr Lys Val Met Ala 75 80 85 gga agt ttg ctt ctt gat aaa cgg tta cga tca gaa tgc aag aat cag 522 Gly Ser Leu Leu Leu Asp Lys Arg Leu Arg Ser Glu Cys Lys Asn Gln 90 95 100 gga gcc acg atc cac ctc ccg ccg tct aac cgc tac gag acg ctg ctg 570 Gly Ala Thr Ile His Leu Pro Pro Ser Asn Arg Tyr Glu Thr Leu Leu 105 110 115 aag cag agg cat gtg cag ctc ctc ggc aga tca ata gac ctc aat cgt 618 Lys Gln Arg His Val Gln Leu Leu Gly Arg Ser Ile Asp Leu Asn Arg 120 125 130 135 ctg atc acc cag cgc gtc tca gca gcc atg tat aag tcc cta gaa ctg 666 Leu Ile Thr Gln Arg Val Ser Ala Ala Met Tyr Lys Ser Leu Glu Leu 140 145 150 gcg att gga cga ttt gaa agt gaa gat ttg acc tcc ata gtt gag ctg 714 Ala Ile Gly Arg Phe Glu Ser Glu Asp Leu Thr Ser Ile Val Glu Leu 155 160 165 gat ggc ctg ttg gaa atc aac cgc atg acc cac aag ctg ctg agc cgg 762 Asp Gly Leu Leu Glu Ile Asn Arg Met Thr His Lys Leu Leu Ser Arg 170 175 180 tac ctg acg ctg gac ggc ttc gac gcc atg ttc cgg gag gcc aac cac 810 Tyr Leu Thr Leu Asp Gly Phe Asp Ala Met Phe Arg Glu Ala Asn His 185 190 195 aac gtg tca gcg ccc tac ggg agg atc acc ctg cac gtc ttc tgg gag 858 Asn Val Ser Ala Pro Tyr Gly Arg Ile Thr Leu His Val Phe Trp Glu 200 205 210 215 ctc aac tat gac ttc ctg ccc aac tac tgc tac aac ggc tct acc aac 906 Leu Asn Tyr Asp Phe Leu Pro Asn Tyr Cys Tyr Asn Gly Ser Thr Asn 220 225 230 cgg ttt gtt cgg aca gtg tta cca ttt tct cag gaa ttt caa aga gat 954 Arg Phe Val Arg Thr Val Leu Pro Phe Ser Gln Glu Phe Gln Arg Asp 235 240 245 aag cag cct aat gca cag cct cag tat ctg cat gga tcc aag gct ttg 1002 Lys Gln Pro Asn Ala Gln Pro Gln Tyr Leu His Gly Ser Lys Ala Leu 250 255 260 aac ttg gcc tac tcc agc att tac ggc agc tac cgg aac ttc gtg gga 1050 Asn Leu Ala Tyr Ser Ser Ile Tyr Gly Ser Tyr Arg Asn Phe Val Gly 265 270 275 cct cca cac ttt caa gtc atc tgc cgg ctt ctc ggc tac cag ggt atc 1098 Pro Pro His Phe Gln Val Ile Cys Arg Leu Leu Gly Tyr Gln Gly Ile 280 285 290 295 gcc gtg gtc atg gag gag ctg ctg aag gtc gtc aag agc ctg ctg caa 1146 Ala Val Val Met Glu Glu Leu Leu Lys Val Val Lys Ser Leu Leu Gln 300 305 310 ggc aca atc ctg cag tac gtg aag acg ctg atg gag gtg atg ccc aag 1194 Gly Thr Ile Leu Gln Tyr Val Lys Thr Leu Met Glu Val Met Pro Lys 315 320 325 atc tgc cgc ctg ccc cgg cac gag tac ggc tct cct ggt atc ctg gag 1242 Ile Cys Arg Leu Pro Arg His Glu Tyr Gly Ser Pro Gly Ile Leu Glu 330 335 340 ttc ttc cac cac cag ctg aag gac atc gtg gag tac gca gag ctg aag 1290 Phe Phe His His Gln Leu Lys Asp Ile Val Glu Tyr Ala Glu Leu Lys 345 350 355 acg gtg tgc ttc cag aac ctg cgg gag gtg ggg aac gcc atc ctc ttc 1338 Thr Val Cys Phe Gln Asn Leu Arg Glu Val Gly Asn Ala Ile Leu Phe 360 365 370 375 tgc ctg ctc atc gag cag agc ctg tct tta gaa gaa gtg tgt gac ctg 1386 Cys Leu Leu Ile Glu Gln Ser Leu Ser Leu Glu Glu Val Cys Asp Leu 380 385 390 ctg cac gcg gct cct ttc cag aac atc ttg ccg cga gtc cat gtg aaa 1434 Leu His Ala Ala Pro Phe Gln Asn Ile Leu Pro Arg Val His Val Lys 395 400 405 gag ggg gag aga ctt gat gcc aaa atg aaa aga cta gaa tca aag tac 1482 Glu Gly Glu Arg Leu Asp Ala Lys Met Lys Arg Leu Glu Ser Lys Tyr 410 415 420 gcc ccg ctg cat ctt gtc cca ctg att gaa aga ctg ggg acc cct cag 1530 Ala Pro Leu His Leu Val Pro Leu Ile Glu Arg Leu Gly Thr Pro Gln 425 430 435 caa att gcc atc gca aga gag ggg gac ctg ctg aca aag gag cgc ctc 1578 Gln Ile Ala Ile Ala Arg Glu Gly Asp Leu Leu Thr Lys Glu Arg Leu 440 445 450 455 tgc tgc ggc ctg tcc atg ttt gag gtc atc ctg aca cgg atc cgg agc 1626 Cys Cys Gly Leu Ser Met Phe Glu Val Ile Leu Thr Arg Ile Arg Ser 460 465 470 ttt ctg gat gac ccc atc tgg cgc ggg cct ctg ccc agc aat ggg gtc 1674 Phe Leu Asp Asp Pro Ile Trp Arg Gly Pro Leu Pro Ser Asn Gly Val 475 480 485 atg cat gtg gac gag tgt gtg gag ttt cac aga ctg tgg agt gcc atg 1722 Met His Val Asp Glu Cys Val Glu Phe His Arg Leu Trp Ser Ala Met 490 495 500 cag ttt gtc tac tgc att ccc gtg ggg aca cac gag ttc aca gtc gag 1770 Gln Phe Val Tyr Cys Ile Pro Val Gly Thr His Glu Phe Thr Val Glu 505 510 515 cag tgc ttt ggt gat ggg cta cac tgg gct ggc tgt atg atc atc gta 1818 Gln Cys Phe Gly Asp Gly Leu His Trp Ala Gly Cys Met Ile Ile Val 520 525 530 535 ctt ctt ggg cag cag cgg cgt ttt gct gtg ctg gat ttc tgc tac cat 1866 Leu Leu Gly Gln Gln Arg Arg Phe Ala Val Leu Asp Phe Cys Tyr His 540 545 550 cta ctt aaa gtc cag aaa cat gat ggc aaa gat gag att att aaa aat 1914 Leu Leu Lys Val Gln Lys His Asp Gly Lys Asp Glu Ile Ile Lys Asn 555 560 565 gtg cct ttg aag aag atg gtg gag aga att cgc aag ttc cag att ctc 1962 Val Pro Leu Lys Lys Met Val Glu Arg Ile Arg Lys Phe Gln Ile Leu 570 575 580 aat gat gag atc atc acc atc ctg gat aag tac ctg aag tca ggc gac 2010 Asn Asp Glu Ile Ile Thr Ile Leu Asp Lys Tyr Leu Lys Ser Gly Asp 585 590 595 ggg gag ggc acg cca gtg gag cat gtg cgc tgc ttc cag ccg ccc atc 2058 Gly Glu Gly Thr Pro Val Glu His Val Arg Cys Phe Gln Pro Pro Ile 600 605 610 615 cac cag tcc ctc gcc agc agc tga gggcacgcgc tgcactccgt aactcaacat 2112 His Gln Ser Leu Ala Ser Ser * 620 ggcatgcctt tctctccgta aactatttag tgagattttt agggactatt tttcagtatc 2172 tctgtacctg ttaaaggggg tgcttttcga tctaaaaact taattttata aaattgactt 2232 atttttctag actaaaattg tatatgcttt tggtaattag gaactcttga gaatattggc 2292 tgctgattgt tgccatcacg ttcctacaaa attgtttttc tatgggatgt tctggcagct 2352 gtgtcataaa atgctgctgg gttcattcat tcattccata agaaacttaa taccagcaaa 2412 tgcattaaat cccttgccag ttaccattaa ctgtaactat ttagcttttg tttagggatc 2472 tttctgatgg tcttttatga gcaatcttag ttctaagtca ttgttcccat cccttttttg 2532 tgtgtttcag aaaatagtga acttgattcc cctgcttcca ctaaatccag ttgtgacaaa 2592 atctaacgtg acatcagatc gaaaggttat agaaataaaa ctaatgagat ctaaaaaaaa 2652 tcgacgcggc cgcgaattcg gatcctcgag agatctcttt ttttgggttt ggtggggtat 2712 cttcatcatc gaatagatag ttatatacat gaccttcaat 2752 13 2909 DNA Homo sapiens CDS (86)..(2302) 13 taagcttgcg gccgccgagt cggtctgcgc agcctcctgc gttttctcgc ttggatcttg 60 gcactgagag gcggtggccg gcggg atg gag aaa agt agg atg aac ctg ccc 112 Met Glu Lys Ser Arg Met Asn Leu Pro 1 5 aag ggg ccg gac acg ctc tgc ttc gac aag gac gag ttc atg aag gaa 160 Lys Gly Pro Asp Thr Leu Cys Phe Asp Lys Asp Glu Phe Met Lys Glu 10 15 20 25 gat ttc gat gtc gat cat ttt gtg tct gac tgt agg aag cgg gtc cag 208 Asp Phe Asp Val Asp His Phe Val Ser Asp Cys Arg Lys Arg Val Gln 30 35 40 ctg gaa gaa ctg aga gat gac ctg gag ctc tac tat aaa ctt ctt aaa 256 Leu Glu Glu Leu Arg Asp Asp Leu Glu Leu Tyr Tyr Lys Leu Leu Lys 45 50 55 aca gcc atg gtc gaa ctc atc aac aag gat tat gca gat ttt gtc aat 304 Thr Ala Met Val Glu Leu Ile Asn Lys Asp Tyr Ala Asp Phe Val Asn 60 65 70 ctt tca aca aac ttg gtt ggc atg gac aaa gcc ctc aac cag ctt tct 352 Leu Ser Thr Asn Leu Val Gly Met Asp Lys Ala Leu Asn Gln Leu Ser 75 80 85 gtg cct ttg gga caa tta cga gaa gag gtt ctg agc ctt aga tcg tct 400 Val Pro Leu Gly Gln Leu Arg Glu Glu Val Leu Ser Leu Arg Ser Ser 90 95 100 105 gtc agt gaa gga att cgg gca gtt gat gaa cga atg tct aaa caa gag 448 Val Ser Glu Gly Ile Arg Ala Val Asp Glu Arg Met Ser Lys Gln Glu 110 115 120 gac att agg aaa aaa aag atg tgt gta ttg agg ctt ata caa gtt att 496 Asp Ile Arg Lys Lys Lys Met Cys Val Leu Arg Leu Ile Gln Val Ile 125 130 135 cgg tca gtt gag aaa att gaa aaa atc tta aac tct caa agt tct aaa 544 Arg Ser Val Glu Lys Ile Glu Lys Ile Leu Asn Ser Gln Ser Ser Lys 140 145 150 gaa acc tct gca cta gaa gca agc agc ccc ctt ttg act gga caa att 592 Glu Thr Ser Ala Leu Glu Ala Ser Ser Pro Leu Leu Thr Gly Gln Ile 155 160 165 ttg gag aga att gcc aca gaa ttt aat cag tta cag ttt cat gct gtt 640 Leu Glu Arg Ile Ala Thr Glu Phe Asn Gln Leu Gln Phe His Ala Val 170 175 180 185 caa agc aaa ggc atg cct ctt ttg gac aaa gta aga ccg cgt ata gct 688 Gln Ser Lys Gly Met Pro Leu Leu Asp Lys Val Arg Pro Arg Ile Ala 190 195 200 ggc att aca gcc atg tta cag cag tca ctg gaa ggt ctc cta tta gaa 736 Gly Ile Thr Ala Met Leu Gln Gln Ser Leu Glu Gly Leu Leu Leu Glu 205 210 215 ggc ctt cag acg tct gac gtc gat ata ata cgg cac tgc ttg cgg act 784 Gly Leu Gln Thr Ser Asp Val Asp Ile Ile Arg His Cys Leu Arg Thr 220 225 230 tac gcc acg att gac aag aca cgg gac gcg gag gcc tta gtt ggc caa 832 Tyr Ala Thr Ile Asp Lys Thr Arg Asp Ala Glu Ala Leu Val Gly Gln 235 240 245 gta cta gtg aaa cca tac ata gac gag gtg att ata gag cag ttt gtt 880 Val Leu Val Lys Pro Tyr Ile Asp Glu Val Ile Ile Glu Gln Phe Val 250 255 260 265 gaa tct cat ccc aat ggc ctt cag gtc atg tat aat aaa ctc ctg gag 928 Glu Ser His Pro Asn Gly Leu Gln Val Met Tyr Asn Lys Leu Leu Glu 270 275 280 ttt gtt cct cac cat tgc cgc ctt ctt cga gaa gtc aca gga ggt gcc 976 Phe Val Pro His His Cys Arg Leu Leu Arg Glu Val Thr Gly Gly Ala 285 290 295 atc tcc agt gaa aaa ggc aat act gtt cct gga tat gac ttt ttg gtg 1024 Ile Ser Ser Glu Lys Gly Asn Thr Val Pro Gly Tyr Asp Phe Leu Val 300 305 310 aat tct gtt tgg cca caa ata gta caa gga tta gaa gaa aag tta ccc 1072 Asn Ser Val Trp Pro Gln Ile Val Gln Gly Leu Glu Glu Lys Leu Pro 315 320 325 tcg ctt ttt aat cct ggg aat ccc gat gca ttt cat gag aaa tat acc 1120 Ser Leu Phe Asn Pro Gly Asn Pro Asp Ala Phe His Glu Lys Tyr Thr 330 335 340 345 ata agt atg gat ttt gtc aga aga ttg gaa cgg cag tgt gga tca cag 1168 Ile Ser Met Asp Phe Val Arg Arg Leu Glu Arg Gln Cys Gly Ser Gln 350 355 360 gct agt gta aag aga tta aga gcc cat cct gcc tat cac agc ttc aat 1216 Ala Ser Val Lys Arg Leu Arg Ala His Pro Ala Tyr His Ser Phe Asn 365 370 375 aag aag tgg aac ttg cct gtt tat ttt caa ata aga ttt aga gaa ata 1264 Lys Lys Trp Asn Leu Pro Val Tyr Phe Gln Ile Arg Phe Arg Glu Ile 380 385 390 gcg gga tcc tta gaa gca gca ctt aca gat gtc ctg gaa gat gcc cca 1312 Ala Gly Ser Leu Glu Ala Ala Leu Thr Asp Val Leu Glu Asp Ala Pro 395 400 405 gct gaa agt ccg tat tgc ctt ttg gct tct cat aga act tgg agc agc 1360 Ala Glu Ser Pro Tyr Cys Leu Leu Ala Ser His Arg Thr Trp Ser Ser 410 415 420 425 ctt agg agg tgt tgg tca gat gag atg ttc ttg cca tta ctg gtg cat 1408 Leu Arg Arg Cys Trp Ser Asp Glu Met Phe Leu Pro Leu Leu Val His 430 435 440 cgc ctg tgg aga ctc act ctg cag att ttg gca cga tac tct gtg ttt 1456 Arg Leu Trp Arg Leu Thr Leu Gln Ile Leu Ala Arg Tyr Ser Val Phe 445 450 455 gtc aat gag ctt tca ctc agg ccc att tct aat gaa agt ccc aag gag 1504 Val Asn Glu Leu Ser Leu Arg Pro Ile Ser Asn Glu Ser Pro Lys Glu 460 465 470 atc aag aaa cct ttg gta act ggt agc aaa gaa cct tcc atc acc caa 1552 Ile Lys Lys Pro Leu Val Thr Gly Ser Lys Glu Pro Ser Ile Thr Gln 475 480 485 gga aac act gaa gac caa gga agt ggt cct tcg gaa aca aag cct gtg 1600 Gly Asn Thr Glu Asp Gln Gly Ser Gly Pro Ser Glu Thr Lys Pro Val 490 495 500 505 gtt tcc att tcc cgc act cag ctc gtg tat gtg gtt gca gac ctg gac 1648 Val Ser Ile Ser Arg Thr Gln Leu Val Tyr Val Val Ala Asp Leu Asp 510 515 520 aag ctt cag gag cag ctt cca gaa ctc ttg gaa ata atc aag cca aaa 1696 Lys Leu Gln Glu Gln Leu Pro Glu Leu Leu Glu Ile Ile Lys Pro Lys 525 530 535 ctt gaa atg att ggc ttt aag aat ttt tct tct atc tca gca gcc ctg 1744 Leu Glu Met Ile Gly Phe Lys Asn Phe Ser Ser Ile Ser Ala Ala Leu 540 545 550 gag gac tcc cag agc tct ttt tca gcc tgt gtg ccc tcc ttg agt agc 1792 Glu Asp Ser Gln Ser Ser Phe Ser Ala Cys Val Pro Ser Leu Ser Ser 555 560 565 aag atc atc cag gat tta agt gac tct tgc ttc ggt ttc cta aaa agc 1840 Lys Ile Ile Gln Asp Leu Ser Asp Ser Cys Phe Gly Phe Leu Lys Ser 570 575 580 585 gcc ctg gag gtt ccc agg ctt tac cga aga acc aat aag gag gtc cca 1888 Ala Leu Glu Val Pro Arg Leu Tyr Arg Arg Thr Asn Lys Glu Val Pro 590 595 600 acc aca gct tcc tcc tat gtg gac agt gct ctg aag ccc tta ttc cag 1936 Thr Thr Ala Ser Ser Tyr Val Asp Ser Ala Leu Lys Pro Leu Phe Gln 605 610 615 ctt cag agc gga cac aag gat aag ctc aaa caa gca ata att cag cag 1984 Leu Gln Ser Gly His Lys Asp Lys Leu Lys Gln Ala Ile Ile Gln Gln 620 625 630 tgg cta gaa ggc act ctc agt gaa agc act cat aag tac tat gaa acc 2032 Trp Leu Glu Gly Thr Leu Ser Glu Ser Thr His Lys Tyr Tyr Glu Thr 635 640 645 gtg tca gat gta tta aac tct gtg aag aag atg gaa gag agc ctg aaa 2080 Val Ser Asp Val Leu Asn Ser Val Lys Lys Met Glu Glu Ser Leu Lys 650 655 660 665 agg ctg aaa caa gcc aga aaa acc act ccc gcc aac ccc gtc ggt ccc 2128 Arg Leu Lys Gln Ala Arg Lys Thr Thr Pro Ala Asn Pro Val Gly Pro 670 675 680 agt ggt ggc atg agc gac gac gac aaa atc agg ctg cag ttg gcc cta 2176 Ser Gly Gly Met Ser Asp Asp Asp Lys Ile Arg Leu Gln Leu Ala Leu 685 690 695 gat gtt gag tac ttg gga gag cag ata caa aag ttg gga cta caa gca 2224 Asp Val Glu Tyr Leu Gly Glu Gln Ile Gln Lys Leu Gly Leu Gln Ala 700 705 710 agt gac ata aaa agc ttc tca gct ctc gca gag ctt gtt gct gct gcc 2272 Ser Asp Ile Lys Ser Phe Ser Ala Leu Ala Glu Leu Val Ala Ala Ala 715 720 725 aag gac cag gca aca gca gag cag cct taa g catcttggaa gatcccgagg 2323 Lys Asp Gln Ala Thr Ala Glu Gln Pro * 730 735 ttagattctt aagcaagaga agagttggac ttccaggctg aaggggagaa agtgactctg 2383 ttctcttagc aaccgtctgt agcaaagaag tgcttccagc atcactccag caacacgccc 2443 atgcgtcttc tctcagcgta tttgggtctt ctttgcccaa aagaacacaa aagccttttt 2503 ccattgtatg gaagatagtt tttaagacat ttgaaacttt ctactatagt ttacagaaca 2563 aattatttta tttttattgt aaatcttagt gtggaagagc tgatttctaa aatatgatta 2623 aagtaaatat atacctatga atatcaagag tcgtctccct gagcctgtag ttggaagtga 2683 cgactgtaat ggaatgatgt cttgtataga aattcgaaat gcccttctct gaaataaaga 2743 gaactcctgg gctttctaaa gaggctgcgg gaagccatcc tccactccca ctgtgtgtga 2803 gagcagtgct tctgatcctg ctgtcacccc gacctctggc aggagccggc gccagtagga 2863 aagacctcct tcctaaataa aagaagtgtc tcccaaaaaa aaaaaa 2909 14 817 DNA Homo sapiens CDS (236)..(535) 14 attggaaaac gctggactaa cgtagaccca agctggctag cgtttaaact taagcttggt 60 accgagctcg gatccactag tccagtgtgg tggaattcgg gaagccgagc tgggcgagaa 120 gtaggggagg gcggtgctcc gccgcggtgg cggttgctat cgcttcgcag aacctactca 180 ggcagccagc tgagaagagt tgagggaaag tgctgctgct gggtctgcag acgcg atg 238 Met 1 gat aac gtg cag ccg aaa ata aaa cat cgc ccc ttc tgc ttc agt gtg 286 Asp Asn Val Gln Pro Lys Ile Lys His Arg Pro Phe Cys Phe Ser Val 5 10 15 aaa ggc cac gtg aag atg ctg cgg ctg gat att atc aac tca ctg gta 334 Lys Gly His Val Lys Met Leu Arg Leu Asp Ile Ile Asn Ser Leu Val 20 25 30 aca aca gta ttc atg ctc atc gta tct gtg ttg gca ctg ata cca gaa 382 Thr Thr Val Phe Met Leu Ile Val Ser Val Leu Ala Leu Ile Pro Glu 35 40 45 acc aca aca ttg aca gtt ggt gga ggg gtg ttt gca ctt gtg aca gca 430 Thr Thr Thr Leu Thr Val Gly Gly Gly Val Phe Ala Leu Val Thr Ala 50 55 60 65 gta tgc tgt ctt gcc gac ggg gcc ctt att tac cgg aag ctt ctg ttc 478 Val Cys Cys Leu Ala Asp Gly Ala Leu Ile Tyr Arg Lys Leu Leu Phe 70 75 80 aat ccc agc ggt cct tac cag aaa aag cct gtg cat gaa aaa aaa gaa 526 Asn Pro Ser Gly Pro Tyr Gln Lys Lys Pro Val His Glu Lys Lys Glu 85 90 95 gtt ttg taa ttttata ttacttttta gtttgatact aagtattaaa catatttctg 582 Val Leu * tattcttcca aaaaaaaaaa aaaaaaaggg gggcccgttt tagagatccc ttgggggggc 642 caaagttaac gcgggcgggc gaagaaaaag ctttttcctt ataggggggg gaattataac 702 ctacctggga atttttggaa agaacctttt tttttggggg ggaaaaaatt gggccaacct 762 ccctacagag tttaaagcct ttagggaaaa aataaaattt ttaagggaaa aaggc 817 15 908 DNA Homo sapiens CDS (386)..(844) 15 cgacaggaag actcctcaga agagctggag tgctgccgtc attaacgcca ggccagaaca 60 cgcgggctaa cagggcgctg gcctgcgtgc ccacctctag gtccgctaat atgtgcccag 120 ctctaggtcc gctaatgtgc tacgcgataa ctgaaagaca gcacaagcca gagcggacgg 180 ccgcgggatc ccgttggggc gcatgcgcga gagcgaacgc ggcgccgtgc gcatgcgcgc 240 aagagagcgg gaagccgagc tgggcgagaa gtaggggagg gcggtgctcc gccgcggtgg 300 cggttgctat cgcttcgcag aacctactca ggcagccagc tgagaagagt tgagggaaag 360 tgctgctgct gggtctgcag acgcg atg gat aac gtg cag ccg aaa ata aaa 412 Met Asp Asn Val Gln Pro Lys Ile Lys 1 5 cat cgc ccc ttc tgc ttc agt gtg aaa ggc cac gtg aag atg ctg cgg 460 His Arg Pro Phe Cys Phe Ser Val Lys Gly His Val Lys Met Leu Arg 10 15 20 25 ctg gca cta act gtg aca tct atg acc ttt ttt atc atc gca caa gcc 508 Leu Ala Leu Thr Val Thr Ser Met Thr Phe Phe Ile Ile Ala Gln Ala 30 35 40 cct gaa cca tat att gtt atc act gga ttt gaa gtc acc gtt atc tta 556 Pro Glu Pro Tyr Ile Val Ile Thr Gly Phe Glu Val Thr Val Ile Leu 45 50 55 ttt ttc ata ctt tta tat gta ctc aga ctt gat cga tta atg aag tgg 604 Phe Phe Ile Leu Leu Tyr Val Leu Arg Leu Asp Arg Leu Met Lys Trp 60 65 70 tta ttt tgg cct ttg ctt gat att atc aac tca ctg gta aca aca gta 652 Leu Phe Trp Pro Leu Leu Asp Ile Ile Asn Ser Leu Val Thr Thr Val 75 80 85 ttc atg ctc atc gta tct gtg ttg gca ctg ata cca gaa acc aca aca 700 Phe Met Leu Ile Val Ser Val Leu Ala Leu Ile Pro Glu Thr Thr Thr 90 95 100 105 ttg aca gtt ggt gga ggg gtg ttt gca ctt gtg aca gca gta tgc tgt 748 Leu Thr Val Gly Gly Gly Val Phe Ala Leu Val Thr Ala Val Cys Cys 110 115 120 ctt gcc gac ggg gcc ctt att tac cgg aag ctt ctg ttc aat ccc agc 796 Leu Ala Asp Gly Ala Leu Ile Tyr Arg Lys Leu Leu Phe Asn Pro Ser 125 130 135 ggt cct tac cag aaa aag cct gtg cat gaa aaa aaa gaa gtt ttg taa 844 Gly Pro Tyr Gln Lys Lys Pro Val His Glu Lys Lys Glu Val Leu * 140 145 150 ttttatatta ctttttagtt tgatactaag tattaaacat atttctgtat tcttccacat 904 aaaa 908 16 7619 DNA Homo sapiens CDS (175)..(717) 16 agcggccgcc ggccccggga ctgaccggcc tcgccgcacc tcccgcaccg actagcgctc 60 ccgggcgctc ctgcgcccga ctcgccctcg cccccactcc ccggcggggt ggcggcggcc 120 gggcccccac ggcggcggcc ggagcagcag cagcagcagc aggagcccgc ctct atg 177 Met 1 atg aag ttc aag ccc aac cag acg cgg acc tac gac cgc gag ggc ttc 225 Met Lys Phe Lys Pro Asn Gln Thr Arg Thr Tyr Asp Arg Glu Gly Phe 5 10 15 aag aag cgg gcg gcg tgc ctg tgc ttc cgg agc gag cag gag gac gag 273 Lys Lys Arg Ala Ala Cys Leu Cys Phe Arg Ser Glu Gln Glu Asp Glu 20 25 30 gtg ctg ctg gtg agt agc agc cgg tac cca gac cag tgg att gtc cca 321 Val Leu Leu Val Ser Ser Ser Arg Tyr Pro Asp Gln Trp Ile Val Pro 35 40 45 gga gga gga atg gaa ccc gag gag gaa cct ggc ggt gct gcc gtg agg 369 Gly Gly Gly Met Glu Pro Glu Glu Glu Pro Gly Gly Ala Ala Val Arg 50 55 60 65 gaa gtt tat gag gag gct gga gtc aaa gga aaa cta ggc aga ctt ctg 417 Glu Val Tyr Glu Glu Ala Gly Val Lys Gly Lys Leu Gly Arg Leu Leu 70 75 80 ggc ata ttt gag aac caa gac cga aag cac aga aca tat gtt tat gtt 465 Gly Ile Phe Glu Asn Gln Asp Arg Lys His Arg Thr Tyr Val Tyr Val 85 90 95 cta aca gtc act gaa ata tta gaa gat tgg gaa gat tct gtt aat att 513 Leu Thr Val Thr Glu Ile Leu Glu Asp Trp Glu Asp Ser Val Asn Ile 100 105 110 gga agg aag aga gag tgg ttc aaa gta gaa gat gct atc aaa gtt ctc 561 Gly Arg Lys Arg Glu Trp Phe Lys Val Glu Asp Ala Ile Lys Val Leu 115 120 125 cag tgt cat aaa cct gta cat gca gag tat ctg gaa aag cta aag ctg 609 Gln Cys His Lys Pro Val His Ala Glu Tyr Leu Glu Lys Leu Lys Leu 130 135 140 145 ggt tgt tcc cca gcc aat gga aat tct aca gtc cct tcc ctt ccg gat 657 Gly Cys Ser Pro Ala Asn Gly Asn Ser Thr Val Pro Ser Leu Pro Asp 150 155 160 aat aat gcc ttg ttt gta acc gct gca cag acc tct ggg ttg cca tct 705 Asn Asn Ala Leu Phe Val Thr Ala Ala Gln Thr Ser Gly Leu Pro Ser 165 170 175 agt gta aga tag aga gaactgggta ggcctctccc accatgtgca gtctcatggg 760 Ser Val Arg * 180 gagaggcttc tttcgtttcc tcgtcaaaca tctgattgac gcttgcaaac tgtctgaatt 820 tgccatgcaa ggttttcaaa caatttgcat gtttttcaga tgctttcaaa tcttttttta 880 aaaaaatagt gtaaaatatt ttaataagcc aaagccatgt ggaatttttg tttagatgcc 940 ttaactgtgc cacaccccac aaccccctat attattttgg ttgtctattt ctcacagcat 1000 attttcagtt ttttgtccat ttgacatcag tctgtggttt attttgtcat cagattactt 1060 gtgggtatac ctaccccaaa attgttttct cattcacagc attagcatat tcagcaaatc 1120 catctgtggt gggaattaaa aatattattg gtattaaaga aatccattca ccccaaaact 1180 tgttttacag gattacaatt ttaattcaaa atttccagat ttgggctatt tctgtatgat 1240 ccaataactt attttgtcac agggcttaat ttgccatttt tggggatttg tcgactcatt 1300 ttgtctgaat tttcacaact ggtattatgt cactagctac ctgatacggc tatttccctt 1360 ataactcaat agtaccttaa cacaaagtat aactctgtag agttggtgaa tattttaggg 1420 aaatattagc aaaatgcatg tagtaaagac atcttatgaa aactgtattc atggaatttg 1480 atttagcatg ctcagttgcc agttcccatt atcgatactc tttctttgca gaatacctta 1540 gaaccgttat ttccctcagt gtagattgct cttaaaatgt attcagttat ttagtggccc 1600 ccacaggagt ggagtcttga aatctaattc taaatgccag tcagtgatga tcgcatcaca 1660 gttgagtgaa atggccttcc tgttcagctg ttagagactg aagattgtta gggcacctta 1720 gaatgtctca tcttttctag gttgtcaaca ggtactattt gtcacataac taactttcga 1780 ggcactggaa catacctgaa ctaagaatta aatcttttac tttatactca cttaaaatca 1840 agaatcccat ctaaaacaca taggtacctt atctgaaact cttgcacttc cccaaccagg 1900 gcagaaatga ggtgggagaa gtttgactaa aatgagggat gggggaaagt aaaagatgtt 1960 tttttttttt tgagactcgc tttgtcaccc aggctggagt gcaatggcac aatctcaact 2020 caccgcaacc tccgcctccc gggttcaggc gattctcctg cctcagcctc ccgagtagtt 2080 gggattacag gcgcctgcct ccatgcctgg ctaattttgt atttttagta gagacagggt 2140 ttcttcatgt tggtcaggct ggtctcaaac tcctaacctc gtgatccgcc tgcctcgacc 2200 tcccaaagtg ctgggattac aggcatgagc caccatgccc agccaaagat cattttttta 2260 tatagacttc agccctttgt aaatattgta actggggagt atagagtaga aaaaaagtat 2320 agttaaaaca tttgttctac aaattaacct ttaaaaatat aattactgct aaaaatagag 2380 tgctgttaca cttaaggaaa attagtgcca ttttggaaat gagatcttgt gccataaata 2440 cagctgaact gaatataaat gttcacaaat taatgctgtc aaaggaatga gtaaagcaga 2500 aaaactttta accagcaaca tttcaagtac gtagtgtgat caaacatgat catccagaat 2560 ttttatattt ttttctttgt acagaggtta catattctgg gtgttctttt ataaaggaaa 2620 cattttaaat cccacaaatt gacactttct atcttcaatg gactaagatt tttttggtca 2680 gtaatctctg aaatttcctt aaattatata cttaacatag cagagaaact ggattgtttt 2740 tgtatataag actgcttcca cctgaaatgc tgtcataagt actggggggg tgggggagtg 2800 cccatcttgt acatgatatt cttagaggta ataaggtaat gatgcatgaa tttattttat 2860 aaactcttgg actatgtatt tgacatgtaa aatatgtaca gtattaacgt caaccatctc 2920 tttaaagtta gcctataaat attgttgtat aattttcttt ggtcagaaac acttggactg 2980 aatagtgaca ctcggtaagg attttcagtg ccatttagca aaacgtcttt agtctatgca 3040 actagcaaaa atctgcataa tgcaacaaca cagttttcac acttcaaatt ttacatgagg 3100 ggttcttcac gattctcctg gaacccagtc tagcaaaatg agaggaaagg ccctagggag 3160 atttatttct gttaaatagt taactctgct gagtatgaca aatagtgtct actatctcaa 3220 ccctccaaaa tttgcagagt gttgggactg tcatttgtga tttttttttt tttttttggt 3280 ggggtagttg aataaaattg ggcagctaaa tttttcagtt ccatgtgcct cccaaataaa 3340 aaacagaaaa taagtagttt ttgtgaattg acttgcagac aaagtagaaa ctgtgctgca 3400 tgatgttatt ttgtaacaga tgacattttc tgaccaggca catgccatcc aattttctgt 3460 caatcacact gttgtataaa gcagcagaac tgaaggggaa aaatgattgt tgtatacact 3520 gaattgcttt gcatggtctc atttgagata attgatgtaa gaatcctgac ttttttatat 3580 ttggaaacat caaataaaaa tggaaaaaat gatcatggct ttaaaaaaaa aaacaaaaac 3640 acacacacat gaactcagat ttgcaaacca ggtttctgaa actttgggta aggtgtatgc 3700 ttttaacttt gaaatgtata aacacccagc ggaaacgtat acctttcagt gcatagtgca 3760 ttgtgtgctt accctgggtt gtttaaactt gtgttcaaat attccccctc caagtgaaag 3820 cctgtcggga attccacagt aacaccttta ctgttctccc attgatgatc atatacgtta 3880 cctcttccta gcggtacatt tgtatgatcc ttactactgg aattaccggg ttaaaaggaa 3940 atgcttacca ctaagatgtc agtattctcc tcatggatat tttcagtctc aatgttgcca 4000 gtctaatagt atagtatgtg gttgctttac tgctgttctc cccacccccg tggagttgtg 4060 tcattatttt aatgaatgtg agctcttgac ttactctaga attctaatac agatactttc 4120 tgcactagta tcacttttat tattgcagtt ccatttaaaa gcataactgg ctaagtcacc 4180 gccccacccg ccgccattac atttcctaaa acatactgct gccaaagacc aactgtagaa 4240 tccttaagtc ctgttcaagt gtcacttcat tttttattag gggtttctta gttggagggc 4300 atgagaaact tggccctttc caatgttgtc ttcacattgg agtataagaa ttctctatag 4360 gactctttta attttttttt tttttttggt agagatgggg tcttgctgtg ttgcccaggc 4420 tggtctcaaa ctcctaggct taagcagtcc tccagcctca gccttctcaa agtgctagga 4480 tgattggcat gagccacctc gtctggcctg caggtctttt taagcatttg ctgttcaata 4540 ataagcatct tctttatgcc tagtatttgt gataagagca caaaacaatt ttctaatctc 4600 aaagagctta tattctagtg gggtagagat aagtgattta tcttggtgta gttttggctt 4660 ttttgtttgt ttttttgaga cagcatctca cactgtcgcc caggctggag tgcagtggtg 4720 caatctcagc ttactgtaac ctccgcctgc caggttcaag cgattctcct gcctcatcct 4780 cccgaggagc tgggatcaca ggcgcctgcc accacgcccg gctaattttt ttgtattttt 4840 agtagagacg gggtttcacc atattggcca ggctggtctc aaactcctga cctcgtgatc 4900 cgcctgcctc ggcctcccta agtgctggga ttatgggcgt gaaccaccgc acctggccta 4960 tcttggtgtt tttacatcct tcagtatttt tcaaggtttt acagcttttt tatctataga 5020 cttataccat cttcaaaagc aatacaattt tatttgtaga cgtgatttcc cccaccaaat 5080 gcacatttta tggagctcta ggacaaaagt caggatgaaa agtaatctat cgctggtgct 5140 actttaatac cttcctgact ggtgccactg gctaaattct agcagtacga tgagagggcc 5200 attaaggcct atcttagcaa gcaagacatt atagcaaaga cctaaacact caaaggttca 5260 acgtcttcta gccagggaac tattctctaa ttcttccacc ttgatatgat ttcagcaaat 5320 tctgataacc cggtattact cttaatgcat ttttgtaaca tttgacaaac atctcccaat 5380 atgtagactc ccactctcct gatgctaatc agtatcagac aatggaagta aattttcctg 5440 cttttctcaa cttttcctca aattcatgtt agtgaagtac tttcatttgg ccatcattat 5500 ttatcaacct taagaaacat gcctattgac gaagtaaata tactaggaat tcaacgtatc 5560 tacgggaatg tggacaaaga catataccaa gacaaggcac tagagtgaaa agccattaaa 5620 ataaaatgct cagcagcaaa ggatttgtaa tggttaactt gcaatatatc catatggtgt 5680 aatattacag tcattagaaa tgacatttgc gtaaggatct gagtggaaac tgatacagcc 5740 tgtcggagag ctactgagta gtattttatc acagctgcat acataccctt tcacccaaca 5800 gttccactcc tgggagtcta gtcaacaaat ggacaagttg cacaaccaag ctatggtcaa 5860 atatgtgtgt tgtaggattt atgaaactgg taccaattta atattcaaat gtgattaagt 5920 catggcaatc tatttactgg atactttgtg gttaccaaag agctatgtgc tgccacaggg 5980 agtcctttca atagtgattg aaaaaggcag aacactacta gacagaacta cctgttttat 6040 gcatgtattg gcatacactg aaaaagctta gtcacttaaa atgaggttgt acagattgta 6100 gattttgttt ttaaacgcct gtatggttta aaaaatgttt tgggaaatag acatgagcag 6160 ggcagagaac tacaacctac aggccaaacc ccaccccact gcctgtttgt acggccaaca 6220 agtttaaaat ttttttaatg ttttaggttt tttttttttt aaaaaaaata ctatgcccat 6280 ttgtttacaa atcgtctgtg ggtgctttct cactacaatg gcagaattca gtagttgaga 6340 cagacctatg gtccacaaag gccgaaatat ttactgtttt aggagaaaaa aacaacatga 6400 attgtaataa cttggactta cttggttgct cagtcctgat gtgtcctgtt aaaacctaag 6460 agaaacagca ccaagttcaa tctagtgcaa tcagcctatc acatctaagc tgcttttgcc 6520 agatctgata tctactcccg accttaagcc aggagacttt gtaaacttac taggttataa 6580 taaaacttaa gaactatcga tatttactaa tctacatcat ggaactattt caaaagcagt 6640 catatggtaa gatggatcag gagtctttaa aaataaaact gagaaagaaa aatcatgtga 6700 tcaattcagt agtagagaca gaggttggta tcaagtctat gttcatacac ttaccagtgc 6760 cacccagcag ggccatacaa gacacatgaa atgagatcag ctcaagggtg acaagatgaa 6820 actgggagag ggccggggag acccgaactc ttaatattgg attccacaca agaggaaaaa 6880 ttttagtcca gctcccagca tggcagctaa tggcaagtta ctcagtacca taaagccaac 6940 tcttggaata tacctgactt ccacgataaa atggagatga gtgcaggggt gagtgtatag 7000 ttaagaaaac aaacaatata ttactagtcc aggtggtaga tgacagattt ttattattta 7060 cacttctctc aaattattac attcagcatg ttttgctttt tatgttattt aaagtataaa 7120 ccctcatata attccctaag atgggtggca tgccaagtag ctaaatgaaa tatcatccca 7180 gcccaaaagt acttattcga atgaaatatt acatttttct taaataaagc aataaattag 7240 gtaccctatt atcatggtat tttctttttt ggccagcttt tctagataag gttgtattgc 7300 tactgcaact aacaaaaaag atgtggcagg gctgggcacg gtggctcacg cctgtaatcc 7360 cagcactttg ggaggccgag gcgggcagat cacgaggtca ggagatggag accatcctgg 7420 cccaacatgg tgaaaccccg tctctactaa aaatacaaaa aattggccag gtgtggtggc 7480 aggtgtctgt agtcccagct cctcgggagg ctgaggcagg agaatcgctt gaacgcggga 7540 ggcagaacct gcagtgagcc aagatcatgc cacagcactc cagcctaaac tacagagcta 7600 gaaagtggct agagcagcc 7619 17 1209 DNA Homo sapiens CDS (196)..(708) 17 atttgaagcc tgctgtaccg ctctcgaatt cccgggtcga cccacgcgtc cgggggccgt 60 gtgcagaccc gcgtgtggcg caggcaagga ccctcaaaat aaacagcctc taccttgcga 120 gccgtcttcc ccaggcctgc gtccgagtct ccgccgctgc gggcccgctc cgacgcggaa 180 gatctgactg cagcc atg agc agc aat gag tgc ttc aag tgt gga cga tct 231 Met Ser Ser Asn Glu Cys Phe Lys Cys Gly Arg Ser 1 5 10 ggc cac tgg gcc cgg gaa tgt cct act ggt gga ggc cgt ggt cgt gga 279 Gly His Trp Ala Arg Glu Cys Pro Thr Gly Gly Gly Arg Gly Arg Gly 15 20 25 atg aga agc cgt ggc aga ggt ttc cag ttt gtt tcc tcg tct ctt cca 327 Met Arg Ser Arg Gly Arg Gly Phe Gln Phe Val Ser Ser Ser Leu Pro 30 35 40 gat att tgt tat cgc tgt ggt gag tct ggt cat ctt gcc aag gat tgt 375 Asp Ile Cys Tyr Arg Cys Gly Glu Ser Gly His Leu Ala Lys Asp Cys 45 50 55 60 gat ctt cag gag gat gcc tgc tat aac tgc ggt aga ggt ggc cac att 423 Asp Leu Gln Glu Asp Ala Cys Tyr Asn Cys Gly Arg Gly Gly His Ile 65 70 75 gcc aag gac tgc aag gag ccc aag aga gag cga gag caa tgc tgc tac 471 Ala Lys Asp Cys Lys Glu Pro Lys Arg Glu Arg Glu Gln Cys Cys Tyr 80 85 90 aac tgt ggc aaa cca ggc cat ctg gct cgt gac tgc gac cat gca gat 519 Asn Cys Gly Lys Pro Gly His Leu Ala Arg Asp Cys Asp His Ala Asp 95 100 105 gag cag aaa tgc tat tct tgt gga gaa ttc gga cac att caa aaa gac 567 Glu Gln Lys Cys Tyr Ser Cys Gly Glu Phe Gly His Ile Gln Lys Asp 110 115 120 tgc acc aaa gtg aag tgc tat agg tgt ggt gaa act ggt cat gta gcc 615 Cys Thr Lys Val Lys Cys Tyr Arg Cys Gly Glu Thr Gly His Val Ala 125 130 135 140 atc aac tgc agc aag aca agt gaa gtc aac tgt tac cgc tgt ggc gag 663 Ile Asn Cys Ser Lys Thr Ser Glu Val Asn Cys Tyr Arg Cys Gly Glu 145 150 155 tca ggg cac ctt gca cgg gaa tgc aca att gag gct aca gcc taa tta 711 Ser Gly His Leu Ala Arg Glu Cys Thr Ile Glu Ala Thr Ala * 160 165 170 ttttcctttg tcgcccctcc tttttctgat tgatggttgt attattttct ctgaatcctc 771 ttcactggcc aaaggttggc agatagaggc aactcccagg ccagtgagct ttacttgccg 831 tgtaaaagga ggaaaggggt ggaaaaaaac cgcctttctg catttaacta caaaaaaagt 891 ttatgtttag tttggtagag gtgttatgta taatgctttg ttaaagaacc ccctttccgt 951 gccactggtg aatagggatt gatgaatggg aaaagttgag tcagaccagt aagcccgtcc 1011 tgggttcctt gaacatgttc ccatgtagga ggtaaaacca attttggaag tgtttatgaa 1071 cttccataaa taactttaat tttagtataa tgatggtctt ggattgtttg ccctcagtag 1131 ctattaaata acatcaagta acatttgtat caggccctac atagaacatc cagttgagtg 1191 agagttccca aaaaaaaa 1209 18 2029 DNA Homo sapiens CDS (186)..(719) 18 attgatcgcc tgtggtaccg gcccggaatt cccgggtcga cccacgcgtc cggcagaccc 60 gcgtgtggcg caggcaagga ccctcaaaat aaacagcctc taccttgcga gccgtcttcc 120 ccaggcctgc gtccgagtct ccgccgctgc gggcccgctc cgacgcggaa gatctgactg 180 cagcc atg agc agc aat gag tgc ttc aag tgt gga cga tct ggc cac 227 Met Ser Ser Asn Glu Cys Phe Lys Cys Gly Arg Ser Gly His 1 5 10 tgg gcc cgg gaa tgt cct act ggt gga ggc cgt ggt cgt gga atg aga 275 Trp Ala Arg Glu Cys Pro Thr Gly Gly Gly Arg Gly Arg Gly Met Arg 15 20 25 30 agc cgt ggc aga ggt ggt ttt acc tcg gat aga ggt ttc cag ttt gtt 323 Ser Arg Gly Arg Gly Gly Phe Thr Ser Asp Arg Gly Phe Gln Phe Val 35 40 45 tcc tcg tct ctt cca gat att tgt tat cgc tgt ggt gag tct ggt cat 371 Ser Ser Ser Leu Pro Asp Ile Cys Tyr Arg Cys Gly Glu Ser Gly His 50 55 60 ctt gcc aag gat tgt gat ctt cag gag gat gcc tgc tat aac tgc ggt 419 Leu Ala Lys Asp Cys Asp Leu Gln Glu Asp Ala Cys Tyr Asn Cys Gly 65 70 75 aga ggt ggc cac att gcc aag gac tgc aag gag ccc aag aga gag cga 467 Arg Gly Gly His Ile Ala Lys Asp Cys Lys Glu Pro Lys Arg Glu Arg 80 85 90 gag caa tgc tgc tac aac tgt ggc aaa cca ggc cat ctg gct cgt gac 515 Glu Gln Cys Cys Tyr Asn Cys Gly Lys Pro Gly His Leu Ala Arg Asp 95 100 105 110 tgc gac cat gca gat gag cag aaa tgc tat tct tgt gga gaa atc gga 563 Cys Asp His Ala Asp Glu Gln Lys Cys Tyr Ser Cys Gly Glu Ile Gly 115 120 125 cac att caa aaa gac tgc acc aaa gtg aag tgc tat agg tgt ggt gaa 611 His Ile Gln Lys Asp Cys Thr Lys Val Lys Cys Tyr Arg Cys Gly Glu 130 135 140 act ggt cat gta gcc atc aac tgc agc aag aca agt gaa gtc aac tgt 659 Thr Gly His Val Ala Ile Asn Cys Ser Lys Thr Ser Glu Val Asn Cys 145 150 155 tac cgc tgt ggc gag tca ggg cac ctt gca cgg gaa tgc aca att gag 707 Tyr Arg Cys Gly Glu Ser Gly His Leu Ala Arg Glu Cys Thr Ile Glu 160 165 170 gct aca gcc taa tta ttttcctttg tcgcccctcc tttttctgat tgatggttgt 762 Ala Thr Ala * 175 attattttct ctgaatcctc ttcactggcc aaaggttggc agatagaggc aactcccagg 822 ccagtgagct ttacttgccg tgtaaaagga ggaaaggggt ggaaaaaaac cgactttctg 882 catttaacta caaaaaaagt ttatgtttag tttggtagag gtgttatgta taatgctttg 942 ttaaagaacc ccctttccgt gccactggtg aatagggatt gatgaatggg aagagttgag 1002 tcagaccagt aagcccgtcc tgggttcctt gaacatgttc ccatgtagga ggtaaaacca 1062 attctggaag tgtctatgaa cttccataaa taactttaat tttagtataa tgatggtctt 1122 ggattgtctg acctcagtag ctattaaata acatcaagta acatctgtat caggccctac 1182 atagaacata cagttgagtg ggagtaaaca aaaagataaa catgcgtgtt aatggctgtt 1242 cgagagaaat cggaataaaa gcctaaacag gaacaacttc atcacagtgt tgatgttgga 1302 cacatagatg gtgatggcaa aggtttagaa cacattattt tcaaagacta aatctaaaac 1362 ccagagtaaa catcaatgct cagagttagc ataatttgga gctattcagg aattgcagag 1422 aaatgcattt tcacagaaat caagatgtta tttttgtata ctatatcact tagacaactg 1482 tgtttcattt gctgtaatca gtttttaaaa gtcagatgga aagagcaact gaagtcctag 1542 aaaatagaaa tgtaatttta aactattcca ataaagctgg aggaggaagg ggagtttgac 1602 taaagttctt tttgtttgtt tcaaattttc attaatgtat atagtgcaaa ataccatatt 1662 aaagagggga atgtggagga ctgaaagctg acagtttgga cttttctttt tgtacttaag 1722 tcatgtcttc aataatgaaa attgctgtta aaaggatgta tgggatttag atacttttgc 1782 aaagctatag aaaattcact ttgtaatctg ttataataat gcccttgagt tctgtgttca 1842 gtctgaacag gttttttggt ggtggcgcct tattccttag tgtttctaaa aaaactttgg 1902 gagagaaaat aaacaaactc ttgacttaat ttttaatttt gaagtgcaaa aagcagcttc 1962 ggctttcaaa gaaagtgatt ttaatttgct gaccaaccga ccctgtgtga aataactttc 2022 ccttttt 2029 19 895 DNA Homo sapiens CDS (46)..(840) 19 atttggccct cgaggccaag aattcggcac gagctgacca gcacc atg gcg gtt 54 Met Ala Val 1 ggc aag aac aag cgc ctt acg aaa ggc ggc aaa aag gga gcc aag aag 102 Gly Lys Asn Lys Arg Leu Thr Lys Gly Gly Lys Lys Gly Ala Lys Lys 5 10 15 aaa gtg gtt gat cca ttt tct aag aaa gat tgg tat gat gtg aaa gca 150 Lys Val Val Asp Pro Phe Ser Lys Lys Asp Trp Tyr Asp Val Lys Ala 20 25 30 35 cct gct atg ttc aat ata aga aat att gga aag acg ctc gtc acc agg 198 Pro Ala Met Phe Asn Ile Arg Asn Ile Gly Lys Thr Leu Val Thr Arg 40 45 50 acc caa gga acc aaa att gca tct gat ggt ctc aag ggt cgt gtg ttt 246 Thr Gln Gly Thr Lys Ile Ala Ser Asp Gly Leu Lys Gly Arg Val Phe 55 60 65 gaa gtg agt ctt gct gat ttg cag aat gat gaa gtt gca ttt aga aaa 294 Glu Val Ser Leu Ala Asp Leu Gln Asn Asp Glu Val Ala Phe Arg Lys 70 75 80 ttc aag ctg att act gaa gat gtt cag ggt aaa aac tgc ctg act aac 342 Phe Lys Leu Ile Thr Glu Asp Val Gln Gly Lys Asn Cys Leu Thr Asn 85 90 95 ttc cat ggc atg gat ctt acc cgt gac aaa atg tgt tcc atg gtc aaa 390 Phe His Gly Met Asp Leu Thr Arg Asp Lys Met Cys Ser Met Val Lys 100 105 110 115 aaa tgg cag aca atg att gaa gct cac gtt gat gtc aag act acc gat 438 Lys Trp Gln Thr Met Ile Glu Ala His Val Asp Val Lys Thr Thr Asp 120 125 130 ggt tac ttg ctt cgt ctg ttc tgt gtt ggt ttt act aaa aaa cgc aac 486 Gly Tyr Leu Leu Arg Leu Phe Cys Val Gly Phe Thr Lys Lys Arg Asn 135 140 145 aat cag ata cgg aag acc tct tat gct cag cac caa cag gtc cgc caa 534 Asn Gln Ile Arg Lys Thr Ser Tyr Ala Gln His Gln Gln Val Arg Gln 150 155 160 atc cgg aag aag atg atg gaa atc atg acc cga gag gtg cag aca aat 582 Ile Arg Lys Lys Met Met Glu Ile Met Thr Arg Glu Val Gln Thr Asn 165 170 175 gac ttg aaa gaa gtg gtc aat aaa ttg att cca gac agc att gga aaa 630 Asp Leu Lys Glu Val Val Asn Lys Leu Ile Pro Asp Ser Ile Gly Lys 180 185 190 195 gac ata gaa aag gct tgc caa tct att tat cct ctc cat gat gtc ttc 678 Asp Ile Glu Lys Ala Cys Gln Ser Ile Tyr Pro Leu His Asp Val Phe 200 205 210 gtt aga aaa gta aaa atg ctg aag aag ccc aag ttt gaa ttg gga aag 726 Val Arg Lys Val Lys Met Leu Lys Lys Pro Lys Phe Glu Leu Gly Lys 215 220 225 ctc atg gag ctt cat ggt gaa ggc agt agt tct gga aaa gcc act ggg 774 Leu Met Glu Leu His Gly Glu Gly Ser Ser Ser Gly Lys Ala Thr Gly 230 235 240 gac gag aca ggt gct aaa gtt gaa cga gct gat gga tat gaa cca cca 822 Asp Glu Thr Gly Ala Lys Val Glu Arg Ala Asp Gly Tyr Glu Pro Pro 245 250 255 gtc caa gaa tct gtt taa agttca gacttcaaat agtggcaaat aaaaagtgct 876 Val Gln Glu Ser Val * 260 atttgtgaaa aaaaaaaaa 895 20 1196 DNA Homo sapiens CDS (97)..(1101) 20 gtccagtgtg gtggaattcc gtcgtctcta tggtggcggc ggatttggag ggaccctacg 60 aaccaggagt caggcgagcc gatctggggc tgcagg atg ttc cgc tgg gag cgc 114 Met Phe Arg Trp Glu Arg 1 5 tcc att ccc ctg cga ggc tcg gcc gcc gcc ctg tgc aac aac ctc agt 162 Ser Ile Pro Leu Arg Gly Ser Ala Ala Ala Leu Cys Asn Asn Leu Ser 10 15 20 gtg ctg cag ctg ccg gct cgc aac ctc acg tat ttt ggc gtg gtt cat 210 Val Leu Gln Leu Pro Ala Arg Asn Leu Thr Tyr Phe Gly Val Val His 25 30 35 gga cca agc gcc cag ctt ctc agc gct gct cct gag ggt gtg ccc ttg 258 Gly Pro Ser Ala Gln Leu Leu Ser Ala Ala Pro Glu Gly Val Pro Leu 40 45 50 gcc cag cgc cag ctc cac gct aag gag ggt gct gga gtg agt ccc cca 306 Ala Gln Arg Gln Leu His Ala Lys Glu Gly Ala Gly Val Ser Pro Pro 55 60 65 70 ctt atc act cag gtc cac tgg tgt gtc ctc ccc ttc cga gtg ctg ctg 354 Leu Ile Thr Gln Val His Trp Cys Val Leu Pro Phe Arg Val Leu Leu 75 80 85 gta ctc acc tca cat cga gga ata cag atg tac gag tcc aat ggc tac 402 Val Leu Thr Ser His Arg Gly Ile Gln Met Tyr Glu Ser Asn Gly Tyr 90 95 100 acc atg gtc tac tgg cat gca ctg gac tct gga gat gcc tcc cca gta 450 Thr Met Val Tyr Trp His Ala Leu Asp Ser Gly Asp Ala Ser Pro Val 105 110 115 cag gct gtg ttt gcc cgg gga att gct gcc agt ggc cac ttc atc tgt 498 Gln Ala Val Phe Ala Arg Gly Ile Ala Ala Ser Gly His Phe Ile Cys 120 125 130 gtg gga acg tgg tca ggc cgg gtg ctg gtg ttt gac atc cca gca aag 546 Val Gly Thr Trp Ser Gly Arg Val Leu Val Phe Asp Ile Pro Ala Lys 135 140 145 150 ggt ccc aac att gta ctg agc gag gag ctg gct ggg cac cag atg cca 594 Gly Pro Asn Ile Val Leu Ser Glu Glu Leu Ala Gly His Gln Met Pro 155 160 165 atc aca gac att gcc acc gag cct gcc cag gga cag gat tgt gtg gct 642 Ile Thr Asp Ile Ala Thr Glu Pro Ala Gln Gly Gln Asp Cys Val Ala 170 175 180 gac atg gtg acg gca gat gac tca ggc ttg ctg tgt gtc tgg cgg tca 690 Asp Met Val Thr Ala Asp Asp Ser Gly Leu Leu Cys Val Trp Arg Ser 185 190 195 ggg cca gaa ttc aca tta ttg acc cgc att cca gga ttt gga gtt ccg 738 Gly Pro Glu Phe Thr Leu Leu Thr Arg Ile Pro Gly Phe Gly Val Pro 200 205 210 tgc ccc tct gtg cag ctg tgg cag ggg atc ata gca gca ggc tat ggg 786 Cys Pro Ser Val Gln Leu Trp Gln Gly Ile Ile Ala Ala Gly Tyr Gly 215 220 225 230 aac gga caa gtg cat cta tat gag gcc act aca gga aat cta cat gtc 834 Asn Gly Gln Val His Leu Tyr Glu Ala Thr Thr Gly Asn Leu His Val 235 240 245 cag atc aat gcc cat gcc cgg gcc atc tgc gcc ctg gac ctg gct ttt 882 Gln Ile Asn Ala His Ala Arg Ala Ile Cys Ala Leu Asp Leu Ala Phe 250 255 260 gag gtg ggc aag cta ctc tct gca ggt gag gac acc ttt gtg cat atc 930 Glu Val Gly Lys Leu Leu Ser Ala Gly Glu Asp Thr Phe Val His Ile 265 270 275 tgg aag ctg agc aga aac cca gag agt ggc tac att gag gtg gaa cac 978 Trp Lys Leu Ser Arg Asn Pro Glu Ser Gly Tyr Ile Glu Val Glu His 280 285 290 tgt cat ggt gag tgt gtc gcc gac acc cag ctg tgt ggt gct cga ttt 1026 Cys His Gly Glu Cys Val Ala Asp Thr Gln Leu Cys Gly Ala Arg Phe 295 300 305 310 tgt gat tcc tca ggc aac tcc ttt gct gtg act ggc tat gac ctt gcg 1074 Cys Asp Ser Ser Gly Asn Ser Phe Ala Val Thr Gly Tyr Asp Leu Ala 315 320 325 gag atc cgg aga ttc agc agt gtg tga gaaga gcagccttcc tttgtccctg 1126 Glu Ile Arg Arg Phe Ser Ser Val * 330 tggtattcat aaagtacccg ctccccccaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1186 aaaaaaaaaa 1196 21 1042 DNA Homo sapiens CDS (97)..(978) 21 gtccagtgtg gtggaattcc gtcgtctcta tggtggcggc ggatttggag ggaccctacg 60 aaccaggagt caggcgagcc gatctggggc tgcagg atg ttc cgc tgg gag cgc 114 Met Phe Arg Trp Glu Arg 1 5 tcc att ccc ctg cga ggc tcg gcc gcc gcc ctg tgc aac aac ctc agt 162 Ser Ile Pro Leu Arg Gly Ser Ala Ala Ala Leu Cys Asn Asn Leu Ser 10 15 20 gtg ctg cag ctg ccg gct cgc aac ctc acg tat ttt ggc gtg gtt cat 210 Val Leu Gln Leu Pro Ala Arg Asn Leu Thr Tyr Phe Gly Val Val His 25 30 35 gga cca agc gcc cag ctt ctc agc gct gct cct gag ggt gtg ccc ttg 258 Gly Pro Ser Ala Gln Leu Leu Ser Ala Ala Pro Glu Gly Val Pro Leu 40 45 50 gcc cag cgc cag ctc cac gct aag gag ggt gct gga gtg agt ccc cca 306 Ala Gln Arg Gln Leu His Ala Lys Glu Gly Ala Gly Val Ser Pro Pro 55 60 65 70 ctt atc act cag gtc cac tgg tgt gtc ctc ccc ttc cga gtg ctg ctg 354 Leu Ile Thr Gln Val His Trp Cys Val Leu Pro Phe Arg Val Leu Leu 75 80 85 gta ctc acc tca cat cga gga ata cag atg tac gag tcc aat ggc tac 402 Val Leu Thr Ser His Arg Gly Ile Gln Met Tyr Glu Ser Asn Gly Tyr 90 95 100 acc atg gtc tac tgg cat gca ctg gac tct gga gat gcc tcc cca gta 450 Thr Met Val Tyr Trp His Ala Leu Asp Ser Gly Asp Ala Ser Pro Val 105 110 115 cag gct gtg ttt gcc cgg gga att gct gcc agt ggc cac ttc atc tgt 498 Gln Ala Val Phe Ala Arg Gly Ile Ala Ala Ser Gly His Phe Ile Cys 120 125 130 gtg gga acg tgg tca ggc cgg gtg ctg gtg ttt gac atc cca gca aag 546 Val Gly Thr Trp Ser Gly Arg Val Leu Val Phe Asp Ile Pro Ala Lys 135 140 145 150 ggt ccc aac att gta ctg agc gag gag ctg gct ggg cac cag atg cca 594 Gly Pro Asn Ile Val Leu Ser Glu Glu Leu Ala Gly His Gln Met Pro 155 160 165 atc aca gac att gcc acc gag cct gcc cag gga cag ctg tgg cag ggg 642 Ile Thr Asp Ile Ala Thr Glu Pro Ala Gln Gly Gln Leu Trp Gln Gly 170 175 180 atc ata gca gca ggc tat ggg aac gga caa gtg cat cta tat gag gcc 690 Ile Ile Ala Ala Gly Tyr Gly Asn Gly Gln Val His Leu Tyr Glu Ala 185 190 195 act aca gga aat cta cat gtc cag atc aat gcc cat gcc cgg gcc atc 738 Thr Thr Gly Asn Leu His Val Gln Ile Asn Ala His Ala Arg Ala Ile 200 205 210 tgc gcc ctg gac ctg gct ttt gag gtg ggc aag cta ctc tct gca ggt 786 Cys Ala Leu Asp Leu Ala Phe Glu Val Gly Lys Leu Leu Ser Ala Gly 215 220 225 230 gag gac acc ttt gtg cat atc tgg aag ctg agc aga aac cca gag agt 834 Glu Asp Thr Phe Val His Ile Trp Lys Leu Ser Arg Asn Pro Glu Ser 235 240 245 ggc tac att gag gtg gaa cac tgt cat ggt gag tgt gtc gcc gac acc 882 Gly Tyr Ile Glu Val Glu His Cys His Gly Glu Cys Val Ala Asp Thr 250 255 260 cag ctg tgt ggt gct cga ttt tgt gat tcc tca ggc aac tcc ttt gct 930 Gln Leu Cys Gly Ala Arg Phe Cys Asp Ser Ser Gly Asn Ser Phe Ala 265 270 275 gtg act ggc tat gac ctt gcg gag atc cgg aga ttc agc agt gtg tga 978 Val Thr Gly Tyr Asp Leu Ala Glu Ile Arg Arg Phe Ser Ser Val * 280 285 290 gaagagcagc cttcctttgt ccctgtggta ttcataaagt acccgctccc cccaaaaaaa 1038 aaaa 1042 22 1111 DNA Homo sapiens CDS (97)..(1047) 22 gtccagtgtg gtggaattcc gtcgtctcta tggtggcggc ggatttggag ggaccctacg 60 aaccaggagt caggcgagcc gatctggggc tgcagg atg ttc cgc tgg gag cgc 114 Met Phe Arg Trp Glu Arg 1 5 tcc att ccc ctg cga ggc tcg gcc gcc gcc ctg tgc aac aac ctc agt 162 Ser Ile Pro Leu Arg Gly Ser Ala Ala Ala Leu Cys Asn Asn Leu Ser 10 15 20 gtg ctg cag ctg ccg gct cgc aac ctc acg tat ttt ggc gtg gtt cat 210 Val Leu Gln Leu Pro Ala Arg Asn Leu Thr Tyr Phe Gly Val Val His 25 30 35 gga cca agc gcc cag ctt ctc agc gct gct cct gag ggt gtg ccc ttg 258 Gly Pro Ser Ala Gln Leu Leu Ser Ala Ala Pro Glu Gly Val Pro Leu 40 45 50 gcc cag cgc cag ctc cac gct aag gag ggt gct gga gtg agt ccc cca 306 Ala Gln Arg Gln Leu His Ala Lys Glu Gly Ala Gly Val Ser Pro Pro 55 60 65 70 ctt atc act cag gtc cac tgg tgt gtc ctc ccc ttc cga gtg ctg ctg 354 Leu Ile Thr Gln Val His Trp Cys Val Leu Pro Phe Arg Val Leu Leu 75 80 85 gta ctc acc tca cat cga gga ata cag atg tac gag tcc aat ggc tac 402 Val Leu Thr Ser His Arg Gly Ile Gln Met Tyr Glu Ser Asn Gly Tyr 90 95 100 acc atg gtc tac tgg cat gca ctg gac tct gga gat gcc tcc cca gga 450 Thr Met Val Tyr Trp His Ala Leu Asp Ser Gly Asp Ala Ser Pro Gly 105 110 115 acg tgg tca ggc cgg gtg ctg gtg ttt gac atc cca gca aag ggt ccc 498 Thr Trp Ser Gly Arg Val Leu Val Phe Asp Ile Pro Ala Lys Gly Pro 120 125 130 aac att gta ctg agc gag gag ctg gct ggg cac cag atg cca atc aca 546 Asn Ile Val Leu Ser Glu Glu Leu Ala Gly His Gln Met Pro Ile Thr 135 140 145 150 gac att gcc acc gag cct gcc cag gga cag gat tgt gtg gct gac atg 594 Asp Ile Ala Thr Glu Pro Ala Gln Gly Gln Asp Cys Val Ala Asp Met 155 160 165 gtg acg gca gat gac tca ggc ttg ctg tgt gtc tgg cgg tca ggg cca 642 Val Thr Ala Asp Asp Ser Gly Leu Leu Cys Val Trp Arg Ser Gly Pro 170 175 180 gaa ttc aca tta ttg acc cgc att cca gga ttt gga gtt ccg tgc ccc 690 Glu Phe Thr Leu Leu Thr Arg Ile Pro Gly Phe Gly Val Pro Cys Pro 185 190 195 tct gtg cag ctg tgg cag ggg atc ata gca gca ggc tat ggg aac gga 738 Ser Val Gln Leu Trp Gln Gly Ile Ile Ala Ala Gly Tyr Gly Asn Gly 200 205 210 caa gtg cat cta tat gag gcc act aca gga aat cta cat gtc cag atc 786 Gln Val His Leu Tyr Glu Ala Thr Thr Gly Asn Leu His Val Gln Ile 215 220 225 230 aat gcc cat gcc cgg gcc atc tgc gcc ctg gac ctg gct ttt gag gtg 834 Asn Ala His Ala Arg Ala Ile Cys Ala Leu Asp Leu Ala Phe Glu Val 235 240 245 ggc aag cta ctc tct gca ggt gag gac acc ttt gtg cat atc tgg aag 882 Gly Lys Leu Leu Ser Ala Gly Glu Asp Thr Phe Val His Ile Trp Lys 250 255 260 ctg agc aga aac cca gag agt ggc tac att gag gtg gaa cac tgt cat 930 Leu Ser Arg Asn Pro Glu Ser Gly Tyr Ile Glu Val Glu His Cys His 265 270 275 ggt gag tgt gtc gcc gac acc cag ctg tgt ggt gct cga ttt tgt gat 978 Gly Glu Cys Val Ala Asp Thr Gln Leu Cys Gly Ala Arg Phe Cys Asp 280 285 290 tcc tca ggc aac tcc ttt gct gtg act ggc tat gac ctt gcg gag atc 1026 Ser Ser Gly Asn Ser Phe Ala Val Thr Gly Tyr Asp Leu Ala Glu Ile 295 300 305 310 cgg aga ttc agc agt gtg tga ga agagcagcct tcctttgtcc ctgtggtatt 1079 Arg Arg Phe Ser Ser Val * 315 cataaagtac ccgctccccc caaaaaaaaa aa 1111 23 1982 DNA Homo sapiens CDS (113)..(1054) 23 aatagtactc atcgctctgc ggaattcggc acgagatgct gccaacttct aaccgcaata 60 gtgactctgt gcttgtctgt ttagttctgt gtataaatgg aatgttgtgg ag atg 115 Met 1 acc cct ccc tgt gcc ggc tgg ttc ctc tcc ctt ttc ccc tgg tca cgg 163 Thr Pro Pro Cys Ala Gly Trp Phe Leu Ser Leu Phe Pro Trp Ser Arg 5 10 15 cta ctc atg gaa gca gga cca gta agg gac ctt cga tta aaa aaa aaa 211 Leu Leu Met Glu Ala Gly Pro Val Arg Asp Leu Arg Leu Lys Lys Lys 20 25 30 aag gcc gtg cgg ccc atc acg gag cgc agc ctc ctg cag ggg gac gag 259 Lys Ala Val Arg Pro Ile Thr Glu Arg Ser Leu Leu Gln Gly Asp Glu 35 40 45 att tgg aat gcc ctg aca gat aat tat ggg aat gtg atg cct gta gac 307 Ile Trp Asn Ala Leu Thr Asp Asn Tyr Gly Asn Val Met Pro Val Asp 50 55 60 65 tgg aag tca tcg cat act agg acc ttg cac ttg ctt act ctg aac ctc 355 Trp Lys Ser Ser His Thr Arg Thr Leu His Leu Leu Thr Leu Asn Leu 70 75 80 tca gaa aaa ggg gta agt gac agt ttg ctc ttt gat aca tca gat gat 403 Ser Glu Lys Gly Val Ser Asp Ser Leu Leu Phe Asp Thr Ser Asp Asp 85 90 95 gaa gag ctg aga gaa cag ctg gat atg cac tca atc atc gtc tcc tgt 451 Glu Glu Leu Arg Glu Gln Leu Asp Met His Ser Ile Ile Val Ser Cys 100 105 110 gtt aat gat gaa ccc ctc ttc acg gca gac cag gtt att gaa gaa att 499 Val Asn Asp Glu Pro Leu Phe Thr Ala Asp Gln Val Ile Glu Glu Ile 115 120 125 gaa gaa atg atg cag gaa tca ccg gac cca gaa gat gat gaa acc cct 547 Glu Glu Met Met Gln Glu Ser Pro Asp Pro Glu Asp Asp Glu Thr Pro 130 135 140 145 aca cag tca gat cgg ctt tca atg ctt tcc cag gaa att caa act ctc 595 Thr Gln Ser Asp Arg Leu Ser Met Leu Ser Gln Glu Ile Gln Thr Leu 150 155 160 aag agg tct agt acc ggc agt tat gaa gag aga gtg aaa agg ctc tca 643 Lys Arg Ser Ser Thr Gly Ser Tyr Glu Glu Arg Val Lys Arg Leu Ser 165 170 175 gtg tct gag tta aat gaa atc ctg gaa gaa att gag act gcc att aag 691 Val Ser Glu Leu Asn Glu Ile Leu Glu Glu Ile Glu Thr Ala Ile Lys 180 185 190 gag tac tct gag gag ctg gtg cag cag ttg gct tta cga gat gaa ctg 739 Glu Tyr Ser Glu Glu Leu Val Gln Gln Leu Ala Leu Arg Asp Glu Leu 195 200 205 gag ttt gaa aag gaa gtg aaa aac agc ttt att tct gtt ctt att gaa 787 Glu Phe Glu Lys Glu Val Lys Asn Ser Phe Ile Ser Val Leu Ile Glu 210 215 220 225 gtg caa aac aaa cag aaa gag cac aaa gaa aca gca aaa aag aaa aag 835 Val Gln Asn Lys Gln Lys Glu His Lys Glu Thr Ala Lys Lys Lys Lys 230 235 240 aaa cta aaa aat ggc agc tct cag aat ggg aag aat gag aga agt cat 883 Lys Leu Lys Asn Gly Ser Ser Gln Asn Gly Lys Asn Glu Arg Ser His 245 250 255 atg ccc ggc aca tat ttg act aca gtc att cct tat gag aaa aaa aac 931 Met Pro Gly Thr Tyr Leu Thr Thr Val Ile Pro Tyr Glu Lys Lys Asn 260 265 270 gga cca ccg tct gtt gaa gat ctt caa ata tta aca aaa att ctt cgt 979 Gly Pro Pro Ser Val Glu Asp Leu Gln Ile Leu Thr Lys Ile Leu Arg 275 280 285 gcc atg aag gag gac agt gaa aaa gtt ccg agc ttg tta act gat tat 1027 Ala Met Lys Glu Asp Ser Glu Lys Val Pro Ser Leu Leu Thr Asp Tyr 290 295 300 305 att ctg aaa gtt ctg tgt cct aca tag agcag caactttatc tgcggtgggc 1079 Ile Leu Lys Val Leu Cys Pro Thr * 310 tccaagctag atttccgaca gcattattct gagagctggc taccattacc cttcttgcta 1139 ttggaaactc agcacatttg aacttgggtt tgattcagta ttaacagatc ttgactacac 1199 taattcttta tattatagaa ccaacggaaa tatgggcact attttgaatt ctagagatgg 1259 tttttgttaa atctactaat aaactgttct cttagtagat taagagagag taatattaat 1319 tgtgcatgtg cagttgtatt tctcattaac tgacagtatg cccatttgtt tttatggctt 1379 tcttatctaa actgcactga tgaactagat taaagccttg ggagatttat actataaatt 1439 cagtgatggg caagaaccaa cactgttttt ttgtgaggaa ttgtcagtgt aactattacc 1499 taccagtatt gttcagagag attgaaacag gaataaacgg ggctgttctt ggaggaaggc 1559 aaaaccagaa tatggcatta ctttggttta atacttagtg ctaacattga aactgttggt 1619 ggtgatggat tttgtagctt gctgcttgtt tcaccactgg tcaaatttta accattaaat 1679 tgccattcac ttttagaatc ttgtatttaa gtaagttttg attttcaaat gttctgcttc 1739 atgtgtctgt gaagaattgt acttttttaa aagtgtgtgt cctctgaggt gcttgagaaa 1799 gtgtacactg cagaactgcc cattctcatt actgtgtcct attttattca tgcctgtgtg 1859 tttttcttaa gtatgaattc tagatacagc tacttatgga ttcatcaata tcatgagcac 1919 ttttgctggt tccagtcaaa tcaatggcat ttaataaatt ttttaagaag taaaaaaaaa 1979 aaa 1982 24 2063 DNA Homo sapiens CDS (113)..(1135) 24 aatagtactc atcgctctgc ggaattcggc acgagatgct gccaacttct aaccgcaata 60 gtgactctgt gcttgtctgt ttagttctgt gtataaatgg aatgttgtgg ag atg 115 Met 1 acc cct ccc tgt gcc ggc tgg ttc ctc tcc ctt ttc ccc tgg tca cgg 163 Thr Pro Pro Cys Ala Gly Trp Phe Leu Ser Leu Phe Pro Trp Ser Arg 5 10 15 cta ctc atg gaa gca gga cca gta agg gac ctt cga tta aaa aaa aaa 211 Leu Leu Met Glu Ala Gly Pro Val Arg Asp Leu Arg Leu Lys Lys Lys 20 25 30 aag gcc gtg cgg ccc atc acg gag cgc agc ctc ctg cag ggg gac gag 259 Lys Ala Val Arg Pro Ile Thr Glu Arg Ser Leu Leu Gln Gly Asp Glu 35 40 45 att tgg aat gcc ctg aca gat aat tat ggg aat gtg atg cct gta gac 307 Ile Trp Asn Ala Leu Thr Asp Asn Tyr Gly Asn Val Met Pro Val Asp 50 55 60 65 tgg aag tca tcg cat act agg acc ttg cac ttg ctt act ctg aac ctc 355 Trp Lys Ser Ser His Thr Arg Thr Leu His Leu Leu Thr Leu Asn Leu 70 75 80 tca gaa aaa ggg gta agt gac agt ttg ctc ttt gat aca tca gat gat 403 Ser Glu Lys Gly Val Ser Asp Ser Leu Leu Phe Asp Thr Ser Asp Asp 85 90 95 gaa gag ctg aga gaa cag ctg gat atg cac tca atc atc gtc tcc tgt 451 Glu Glu Leu Arg Glu Gln Leu Asp Met His Ser Ile Ile Val Ser Cys 100 105 110 gtt aat gat gaa ccc ctc ttc acg gca gac cag gtt att gaa gaa att 499 Val Asn Asp Glu Pro Leu Phe Thr Ala Asp Gln Val Ile Glu Glu Ile 115 120 125 gaa gaa atg atg cag gaa tca ccg gac cca gaa gat gat gaa acc cct 547 Glu Glu Met Met Gln Glu Ser Pro Asp Pro Glu Asp Asp Glu Thr Pro 130 135 140 145 aca cag tca gat cgg ctt tca atg ctt tcc cag gaa att caa act ctc 595 Thr Gln Ser Asp Arg Leu Ser Met Leu Ser Gln Glu Ile Gln Thr Leu 150 155 160 aag agg tct agt acc ggc agt tat gaa gag aga gtg aaa agg ctc tca 643 Lys Arg Ser Ser Thr Gly Ser Tyr Glu Glu Arg Val Lys Arg Leu Ser 165 170 175 gtg tct gag tta aat gaa atc ctg gaa gaa att gag act gcc att aag 691 Val Ser Glu Leu Asn Glu Ile Leu Glu Glu Ile Glu Thr Ala Ile Lys 180 185 190 gag tac tct gag gag ctg gtg cag cag ttg gct tta cga gat gaa ctg 739 Glu Tyr Ser Glu Glu Leu Val Gln Gln Leu Ala Leu Arg Asp Glu Leu 195 200 205 gag ttt gaa aag gaa gtg aaa aac agc ttt att tct gtt ctt att gaa 787 Glu Phe Glu Lys Glu Val Lys Asn Ser Phe Ile Ser Val Leu Ile Glu 210 215 220 225 gtg caa aac aaa cag aaa gag cac aaa gaa aca gca aaa aag aaa aag 835 Val Gln Asn Lys Gln Lys Glu His Lys Glu Thr Ala Lys Lys Lys Lys 230 235 240 aaa cta aaa aat ggc agc tct cag aat ggg aag aat gag aga agt cat 883 Lys Leu Lys Asn Gly Ser Ser Gln Asn Gly Lys Asn Glu Arg Ser His 245 250 255 atg ccc ggc aca cgc ttc agc atg gaa ggg atc tca aat gtc att cag 931 Met Pro Gly Thr Arg Phe Ser Met Glu Gly Ile Ser Asn Val Ile Gln 260 265 270 aat ggc ctc cgc cac acc ttt gga aat tca ggt gga gag aaa cag tat 979 Asn Gly Leu Arg His Thr Phe Gly Asn Ser Gly Gly Glu Lys Gln Tyr 275 280 285 ttg act aca gtc att cct tat gag aaa aaa aac gga cca ccg tct gtt 1027 Leu Thr Thr Val Ile Pro Tyr Glu Lys Lys Asn Gly Pro Pro Ser Val 290 295 300 305 gaa gat ctt caa ata tta aca aaa att ctt cgt gcc atg aag gag gac 1075 Glu Asp Leu Gln Ile Leu Thr Lys Ile Leu Arg Ala Met Lys Glu Asp 310 315 320 agt gaa aaa gtt ccg agc ttg tta act gat tat att ctg aaa gtt ctg 1123 Ser Glu Lys Val Pro Ser Leu Leu Thr Asp Tyr Ile Leu Lys Val Leu 325 330 335 tgt cct aca tag agc agcaacttta tctgcggtgg gctccaagct agatttccga 1178 Cys Pro Thr * 340 cagcattatt ctgagagctg gctaccatta cccttcttgc tattggaaac tcagcacatt 1238 tgaacttggg tttgattcag tattaacaga tcttgactac actaattctt tatattatag 1298 aaccaacgga aatatgggca ctattttgaa ttctagagat ggtttttgtt aaatctacta 1358 ataaactgtt ctcttagtag attaagagag agtaatatta attgtgcatg tgcagttgta 1418 tttctcatta actgacagta tgcccatttg tttttatggc tttcttatct aaactgcact 1478 gatgaactag attaaagcct tgggagattt atactataaa ttcagtgatg ggcaagaacc 1538 aacactgttt ttttgtgagg aattgtcagt gtaactatta cctaccagta ttgttcagag 1598 agattgaaac aggaataaac ggggctgttc ttggaggaag gcaaaaccag aatatggcat 1658 tactttggtt taatacttag tgctaacatt gaaactgttg gtggtgatgg attttgtagc 1718 ttgctgcttg tttcaccact ggtcaaattt taaccattaa attgccattc acttttagaa 1778 tcttgtattt aagtaagttt tgattttcaa atgttctgct tcatgtgtct gtgaagaatt 1838 gtactttttt aaaagtgtgt gtcctctgag gtgcttgaga aagtgtacac tgcagaactg 1898 cccattctca ttactgtgtc ctattttatt catgcctgtg tgtttttctt aagtatgaat 1958 tctagataca gctacttatg gattcatcaa tatcatgagc acttttgctg gttccagtca 2018 aatcaatggc atttaataaa ttttttaaga agtaaaaaaa aaaaa 2063 25 2129 DNA Homo sapiens CDS (198)..(1781) 25 aagctggtac gcctgcaggt accggtccgg aattcccggg tcgacgattt cgtgcaggaa 60 ccgcggctgc tggacaagag gggtgcggtg gatactgacc tttgctccgg cctcgtgtag 120 cgtgaagaca cagcgcatct ccccgctgta ggcttcctcc cacagaaccc gtttcgggcc 180 tcagagcgtc tggtgag atg ctg ttg ccg ctg ctg ctg ctg cta ccc atg 230 Met Leu Leu Pro Leu Leu Leu Leu Leu Pro Met 1 5 10 tgc tgg gcc gtg gag gtc aag agg ccc cgg ggc gtc tcc ctc acc aat 278 Cys Trp Ala Val Glu Val Lys Arg Pro Arg Gly Val Ser Leu Thr Asn 15 20 25 cat cac ttc tac gat gag tcc aag cct ttc acc tgc ctg gac ggt tcg 326 His His Phe Tyr Asp Glu Ser Lys Pro Phe Thr Cys Leu Asp Gly Ser 30 35 40 gcc acc atc cca ttt gat cag gtc aac gat gac tat tgc gac tgc aaa 374 Ala Thr Ile Pro Phe Asp Gln Val Asn Asp Asp Tyr Cys Asp Cys Lys 45 50 55 gat ggc tct gac gag cca ggc acg gct gcc tgt cct aat ggc agc ttc 422 Asp Gly Ser Asp Glu Pro Gly Thr Ala Ala Cys Pro Asn Gly Ser Phe 60 65 70 75 cac tgc acc aac act ggc tat aag ccc ctg tat atc ccc tcc aac cgg 470 His Cys Thr Asn Thr Gly Tyr Lys Pro Leu Tyr Ile Pro Ser Asn Arg 80 85 90 gtc aac gat ggt gtt tgt gac tgc tgc gat gga aca gac gag tac aac 518 Val Asn Asp Gly Val Cys Asp Cys Cys Asp Gly Thr Asp Glu Tyr Asn 95 100 105 agc ggc gtc atc tgt gag aac acc tgc aaa gag aag ggc cgt aag gag 566 Ser Gly Val Ile Cys Glu Asn Thr Cys Lys Glu Lys Gly Arg Lys Glu 110 115 120 aga gag tcc ctg cag cag atg gcc gag gtc acc cgc gaa ggg ttc cgt 614 Arg Glu Ser Leu Gln Gln Met Ala Glu Val Thr Arg Glu Gly Phe Arg 125 130 135 ctg aag aag atc ctt att gag gac tgg aag aag gca cgg gag gag aag 662 Leu Lys Lys Ile Leu Ile Glu Asp Trp Lys Lys Ala Arg Glu Glu Lys 140 145 150 155 cag aaa aag ctc att gag cta cag gct ggg aag aag tct ctg gaa gac 710 Gln Lys Lys Leu Ile Glu Leu Gln Ala Gly Lys Lys Ser Leu Glu Asp 160 165 170 cag gtg gag atg ctg cgg aca gtg aag gag gaa gct gag aag cca gag 758 Gln Val Glu Met Leu Arg Thr Val Lys Glu Glu Ala Glu Lys Pro Glu 175 180 185 aga gag gcc aaa gag cag cac cag aag ctg tgg gaa gag cag ctg gct 806 Arg Glu Ala Lys Glu Gln His Gln Lys Leu Trp Glu Glu Gln Leu Ala 190 195 200 gct gcc aag gcc caa cag gag cag gag ctg gcg gct gat gcc ttc aag 854 Ala Ala Lys Ala Gln Gln Glu Gln Glu Leu Ala Ala Asp Ala Phe Lys 205 210 215 gag ctg gat gat gac atg gac ggg acg gtc tcg gtg act gag ctg cag 902 Glu Leu Asp Asp Asp Met Asp Gly Thr Val Ser Val Thr Glu Leu Gln 220 225 230 235 act cac ccg gag ctg gac aca gat ggg gat ggg gcg ttg tca gaa gcg 950 Thr His Pro Glu Leu Asp Thr Asp Gly Asp Gly Ala Leu Ser Glu Ala 240 245 250 gaa gct cag gcc ctc ctc agt ggg gac aca cag aca gac gcc acc tct 998 Glu Ala Gln Ala Leu Leu Ser Gly Asp Thr Gln Thr Asp Ala Thr Ser 255 260 265 ttc tac gac cgc gtc tgg gcc gcc atc agg gac aag tac cgg tcc gag 1046 Phe Tyr Asp Arg Val Trp Ala Ala Ile Arg Asp Lys Tyr Arg Ser Glu 270 275 280 gca ctg ccc acc gac ctt cca gca cct tct gcc cct gac ttg acg gag 1094 Ala Leu Pro Thr Asp Leu Pro Ala Pro Ser Ala Pro Asp Leu Thr Glu 285 290 295 ccc aag gag gag cag ccg cca gtg ccc tcg tcg ccc aca gag gag gag 1142 Pro Lys Glu Glu Gln Pro Pro Val Pro Ser Ser Pro Thr Glu Glu Glu 300 305 310 315 gag gag gag gag gag gag gaa gaa gag gct gaa gaa gag gag gag gag 1190 Glu Glu Glu Glu Glu Glu Glu Glu Glu Ala Glu Glu Glu Glu Glu Glu 320 325 330 gag gat tcc gag gag gcc cca ccg cca ctg tca ccc ccg cag ccg gcc 1238 Glu Asp Ser Glu Glu Ala Pro Pro Pro Leu Ser Pro Pro Gln Pro Ala 335 340 345 agc cct gct gag gaa gac aaa atg ccg ccc tac gac gag cag acg cag 1286 Ser Pro Ala Glu Glu Asp Lys Met Pro Pro Tyr Asp Glu Gln Thr Gln 350 355 360 gcc ttc atc gat gct gcc cag gag gcc cgc aac aag ttc gag gag gcc 1334 Ala Phe Ile Asp Ala Ala Gln Glu Ala Arg Asn Lys Phe Glu Glu Ala 365 370 375 gag cgg tcg ctg aag gac atg gag gag tcc atc agg aac ctg gag caa 1382 Glu Arg Ser Leu Lys Asp Met Glu Glu Ser Ile Arg Asn Leu Glu Gln 380 385 390 395 gag att tct ttt gac ttt ggc ccc aac ggg gag ttt gct tac ctg tac 1430 Glu Ile Ser Phe Asp Phe Gly Pro Asn Gly Glu Phe Ala Tyr Leu Tyr 400 405 410 agc cag tgc tac gag ctc acc acc aac gaa tac gtc tac cgc ctc tgc 1478 Ser Gln Cys Tyr Glu Leu Thr Thr Asn Glu Tyr Val Tyr Arg Leu Cys 415 420 425 ccc ttc aag ctt gtc tcg cag aaa ccc aaa ctc ggg ggc tct ccc acc 1526 Pro Phe Lys Leu Val Ser Gln Lys Pro Lys Leu Gly Gly Ser Pro Thr 430 435 440 agc ctt ggc acc tgg ggc tca tgg att ggc ccc gac cac gac aag ttc 1574 Ser Leu Gly Thr Trp Gly Ser Trp Ile Gly Pro Asp His Asp Lys Phe 445 450 455 agt gcc atg aag tat gag caa ggc acg ggc tgc tgg cag ggc ccc aac 1622 Ser Ala Met Lys Tyr Glu Gln Gly Thr Gly Cys Trp Gln Gly Pro Asn 460 465 470 475 cgc tcc acc acc gtg cgc ctc ctg tgc ggg aaa gag acc atg gtg acc 1670 Arg Ser Thr Thr Val Arg Leu Leu Cys Gly Lys Glu Thr Met Val Thr 480 485 490 agc acc aca gag ccc agt cgc tgc gag tac ctc atg gag ctg atg acg 1718 Ser Thr Thr Glu Pro Ser Arg Cys Glu Tyr Leu Met Glu Leu Met Thr 495 500 505 cca gcc gcc tgc ccg gag cca ccg cct gaa gca ccc acc gaa gac gac 1766 Pro Ala Ala Cys Pro Glu Pro Pro Pro Glu Ala Pro Thr Glu Asp Asp 510 515 520 cat gac gag ctc tag ctggatgggc gcagagaacc ttcaagaagg catgaagcca 1821 His Asp Glu Leu * 525 gcccctgcag tgccgtccac ccgcccctct gggcctgcct gtggctctgt tgccctcctc 1881 tgtggcggca ggacctttgt ggggcttcgt gccctgctct ggggcccagg cggggctggt 1941 ccacattccc aggccccaac agccttcaaa gatgggtaaa ggagcttgcc ctccctgggc 2001 cccccacctt ggtgactcgc cccaccaccc ccagccctgt ccctgccacc cctcctagtg 2061 gggactagtg aatgacttga cctgtgacct caatacaata aatgtgatcc cccaccccaa 2121 aaaaaaaa 2129 26 2120 DNA Homo sapiens CDS (198)..(1772) 26 aagctggtac gcctgcaggt accggtccgg aattcccggg tcgacgattt cgtgcaggaa 60 ccgcggctgc tggacaagag gggtgcggtg gatactgacc tttgctccgg cctcgtgtag 120 cgtgaagaca cagcgcatct ccccgctgta ggcttcctcc cacagaaccc gtttcgggcc 180 tcagagcgtc tggtgag atg ctg ttg ccg ctg ctg ctg ctg cta ccc atg 230 Met Leu Leu Pro Leu Leu Leu Leu Leu Pro Met 1 5 10 tgc tgg gcc gtg gag gtc aag agg ccc cgg ggc gtc tcc ctc acc aat 278 Cys Trp Ala Val Glu Val Lys Arg Pro Arg Gly Val Ser Leu Thr Asn 15 20 25 cat cac ttc tac gat gag tcc aag cct ttc acc tgc ctg gac ggt tcg 326 His His Phe Tyr Asp Glu Ser Lys Pro Phe Thr Cys Leu Asp Gly Ser 30 35 40 gcc acc atc cca ttt gat cag gtc aac gat gac tat tgc gac tgc aaa 374 Ala Thr Ile Pro Phe Asp Gln Val Asn Asp Asp Tyr Cys Asp Cys Lys 45 50 55 gat ggc tct gac gag cca ggc acg gct gcc tgt cct aat ggc agc ttc 422 Asp Gly Ser Asp Glu Pro Gly Thr Ala Ala Cys Pro Asn Gly Ser Phe 60 65 70 75 cac tgc acc aac act ggc tat aag ccc ctg tat atc ccc tcc aac cgg 470 His Cys Thr Asn Thr Gly Tyr Lys Pro Leu Tyr Ile Pro Ser Asn Arg 80 85 90 gtc aac gat ggt gtt tgt gac tgc tgc gat gga aca gac gag tac aac 518 Val Asn Asp Gly Val Cys Asp Cys Cys Asp Gly Thr Asp Glu Tyr Asn 95 100 105 agc ggc gtc atc tgt gag aac acc tgc aaa gag aag ggc cgt aag gag 566 Ser Gly Val Ile Cys Glu Asn Thr Cys Lys Glu Lys Gly Arg Lys Glu 110 115 120 aga gag tcc ctg cag cag atg gcc gag gtc acc cgc gaa ggg ttc cgt 614 Arg Glu Ser Leu Gln Gln Met Ala Glu Val Thr Arg Glu Gly Phe Arg 125 130 135 ctg aag aag atc ctt att gag gac tgg aag aag gca cgg gag gag aag 662 Leu Lys Lys Ile Leu Ile Glu Asp Trp Lys Lys Ala Arg Glu Glu Lys 140 145 150 155 cag aaa aag ctc att gag cta cag gct ggg aag aag tct ctg gaa gac 710 Gln Lys Lys Leu Ile Glu Leu Gln Ala Gly Lys Lys Ser Leu Glu Asp 160 165 170 cag gtg gag atg ctg cgg aca gtg aag gag gaa gct gag aag cca gag 758 Gln Val Glu Met Leu Arg Thr Val Lys Glu Glu Ala Glu Lys Pro Glu 175 180 185 aga gag gcc aaa gag cag cac cag aag ctg tgg gaa gag cag ctg gct 806 Arg Glu Ala Lys Glu Gln His Gln Lys Leu Trp Glu Glu Gln Leu Ala 190 195 200 gct gcc aag gcc caa cag gag cag gag ctg gcg gct gat gcc ttc aag 854 Ala Ala Lys Ala Gln Gln Glu Gln Glu Leu Ala Ala Asp Ala Phe Lys 205 210 215 gag ctg gat gat gac atg gac ggg acg gtc tcg gtg act gag ctg cag 902 Glu Leu Asp Asp Asp Met Asp Gly Thr Val Ser Val Thr Glu Leu Gln 220 225 230 235 act cac ccg gag ctg gac aca gat ggg gat ggg gcg ttg tca gaa gcg 950 Thr His Pro Glu Leu Asp Thr Asp Gly Asp Gly Ala Leu Ser Glu Ala 240 245 250 gaa gct cag gcc ctc ctc agt ggg gac aca cag aca gac gcc acc tct 998 Glu Ala Gln Ala Leu Leu Ser Gly Asp Thr Gln Thr Asp Ala Thr Ser 255 260 265 ttc tac gac cgc gtc tgg gcc gcc atc agg gac aag tac cgg tcc gag 1046 Phe Tyr Asp Arg Val Trp Ala Ala Ile Arg Asp Lys Tyr Arg Ser Glu 270 275 280 gca ctg ccc acc gac ctt cca gca cct tct gcc cct gac ttg acg gag 1094 Ala Leu Pro Thr Asp Leu Pro Ala Pro Ser Ala Pro Asp Leu Thr Glu 285 290 295 ccc aag gag gag cag ccg cca gtg ccc tcg tcg ccc aca gag gag gag 1142 Pro Lys Glu Glu Gln Pro Pro Val Pro Ser Ser Pro Thr Glu Glu Glu 300 305 310 315 gag gag gag gag gag gag gaa gaa gag gct gaa gaa gag gag gag gag 1190 Glu Glu Glu Glu Glu Glu Glu Glu Glu Ala Glu Glu Glu Glu Glu Glu 320 325 330 gag gat tcc gag gtg cag ggg gag cag ccc aag ccg gcc agc cct gct 1238 Glu Asp Ser Glu Val Gln Gly Glu Gln Pro Lys Pro Ala Ser Pro Ala 335 340 345 gag gaa gac aaa atg ccg ccc tac gac gag cag acg cag gcc ttc atc 1286 Glu Glu Asp Lys Met Pro Pro Tyr Asp Glu Gln Thr Gln Ala Phe Ile 350 355 360 gat gct gcc cag gag gcc cgc aac aag ttc gag gag gcc gag cgg tcg 1334 Asp Ala Ala Gln Glu Ala Arg Asn Lys Phe Glu Glu Ala Glu Arg Ser 365 370 375 ctg aag gac atg gag gag tcc atc agg aac ctg gag caa gag att tct 1382 Leu Lys Asp Met Glu Glu Ser Ile Arg Asn Leu Glu Gln Glu Ile Ser 380 385 390 395 ttt gac ttt ggc ccc aac ggg gag ttt gct tac ctg tac agc cag tgc 1430 Phe Asp Phe Gly Pro Asn Gly Glu Phe Ala Tyr Leu Tyr Ser Gln Cys 400 405 410 tac gag ctc acc acc aac gaa tac gtc tac cgc ctc tgc ccc ttc aag 1478 Tyr Glu Leu Thr Thr Asn Glu Tyr Val Tyr Arg Leu Cys Pro Phe Lys 415 420 425 ctt gtc tcg cag aaa ccc aaa ctc ggg ggc tct ccc acc agc ctt ggc 1526 Leu Val Ser Gln Lys Pro Lys Leu Gly Gly Ser Pro Thr Ser Leu Gly 430 435 440 acc tgg ggc tca tgg att ggc ccc gac cac gac aag ttc agt gcc atg 1574 Thr Trp Gly Ser Trp Ile Gly Pro Asp His Asp Lys Phe Ser Ala Met 445 450 455 aag tat gag caa ggc acg ggc tgc tgg cag ggc ccc aac cgc tcc acc 1622 Lys Tyr Glu Gln Gly Thr Gly Cys Trp Gln Gly Pro Asn Arg Ser Thr 460 465 470 475 acc gtg cgc ctc ctg tgc ggg aaa gag acc atg gtg acc agc acc aca 1670 Thr Val Arg Leu Leu Cys Gly Lys Glu Thr Met Val Thr Ser Thr Thr 480 485 490 gag ccc agt cgc tgc gag tac ctc atg gag ctg atg acg cca gcc gcc 1718 Glu Pro Ser Arg Cys Glu Tyr Leu Met Glu Leu Met Thr Pro Ala Ala 495 500 505 tgc ccg gag cca ccg cct gaa gca ccc acc gaa gac gac cat gac gag 1766 Cys Pro Glu Pro Pro Pro Glu Ala Pro Thr Glu Asp Asp His Asp Glu 510 515 520 ctc tag ctggatgggc gcagagaacc ttcaagaagg catgaagcca gcccctgcag 1822 Leu * tgccgtccac ccgcccctct gggcctgcct gtggctctgt tgccctcctc tgtggcggca 1882 ggacctttgt ggggcttcgt gccctgctct ggggcccagg cggggctggt ccacattccc 1942 aggccccaac agccttcaaa gatgggtaaa ggagcttgcc ctccctgggc cccccacctt 2002 ggtgactcgc cccaccaccc ccagccctgt ccctgccacc cctcctagtg gggactagtg 2062 aatgacttga cctgtgacct caatacaata aatgtgatcc cccaccccaa aaaaaaaa 2120 27 2150 DNA Homo sapiens CDS (198)..(1802) 27 aagctggtac gcctgcaggt accggtccgg aattcccggg tcgacgattt cgtgcaggaa 60 ccgcggctgc tggacaagag gggtgcggtg gatactgacc tttgctccgg cctcgtgtag 120 cgtgaagaca cagcgcatct ccccgctgta ggcttcctcc cacagaaccc gtttcgggcc 180 tcagagcgtc tggtgag atg ctg ttg ccg ctg ctg ctg ctg cta ccc atg 230 Met Leu Leu Pro Leu Leu Leu Leu Leu Pro Met 1 5 10 tgc tgg gcc gtg gag gtc aag agg ccc cgg ggc gtc tcc ctc acc aat 278 Cys Trp Ala Val Glu Val Lys Arg Pro Arg Gly Val Ser Leu Thr Asn 15 20 25 cat cac ttc tac gat gag tcc aag cct ttc acc tgc ctg gac ggt tcg 326 His His Phe Tyr Asp Glu Ser Lys Pro Phe Thr Cys Leu Asp Gly Ser 30 35 40 gcc acc atc cca ttt gat cag gtc aac gat gac tat tgc gac tgc aaa 374 Ala Thr Ile Pro Phe Asp Gln Val Asn Asp Asp Tyr Cys Asp Cys Lys 45 50 55 gat ggc tct gac gag cca ggc acg gct gcc tgt cct aat ggc agc ttc 422 Asp Gly Ser Asp Glu Pro Gly Thr Ala Ala Cys Pro Asn Gly Ser Phe 60 65 70 75 cac tgc acc aac act ggc tat aag ccc ctg tat atc ccc tcc aac cgg 470 His Cys Thr Asn Thr Gly Tyr Lys Pro Leu Tyr Ile Pro Ser Asn Arg 80 85 90 gtc aac gat ggt gtt tgt gac tgc tgc gat gga aca gac gag tac aac 518 Val Asn Asp Gly Val Cys Asp Cys Cys Asp Gly Thr Asp Glu Tyr Asn 95 100 105 agc ggc gtc atc tgt gag aac acc tgc aaa gag aag ggc cgt aag gag 566 Ser Gly Val Ile Cys Glu Asn Thr Cys Lys Glu Lys Gly Arg Lys Glu 110 115 120 aga gag tcc ctg cag cag atg gcc gag gtc acc cgc gaa ggg ttc cgt 614 Arg Glu Ser Leu Gln Gln Met Ala Glu Val Thr Arg Glu Gly Phe Arg 125 130 135 ctg aag aag atc ctt att gag gac tgg aag aag gca cgg gag gag aag 662 Leu Lys Lys Ile Leu Ile Glu Asp Trp Lys Lys Ala Arg Glu Glu Lys 140 145 150 155 cag aaa aag ctc att gag cta cag gct ggg aag aag tct ctg gaa gac 710 Gln Lys Lys Leu Ile Glu Leu Gln Ala Gly Lys Lys Ser Leu Glu Asp 160 165 170 cag gtg gag atg ctg cgg aca gtg aag gag gaa gct gag aag cca gag 758 Gln Val Glu Met Leu Arg Thr Val Lys Glu Glu Ala Glu Lys Pro Glu 175 180 185 aga gag gcc aaa gag cag cac cag aag ctg tgg gaa gag cag ctg gct 806 Arg Glu Ala Lys Glu Gln His Gln Lys Leu Trp Glu Glu Gln Leu Ala 190 195 200 gct gcc aag gcc caa cag gag cag gag ctg gcg gct gat gcc ttc aag 854 Ala Ala Lys Ala Gln Gln Glu Gln Glu Leu Ala Ala Asp Ala Phe Lys 205 210 215 gag ctg gat gat gac atg gac ggg acg gtc tcg gtg act gag ctg cag 902 Glu Leu Asp Asp Asp Met Asp Gly Thr Val Ser Val Thr Glu Leu Gln 220 225 230 235 act cac ccg gag ctg gac aca gat ggg gat ggg gcg ttg tca gaa gcg 950 Thr His Pro Glu Leu Asp Thr Asp Gly Asp Gly Ala Leu Ser Glu Ala 240 245 250 gaa gct cag gcc ctc ctc agt ggg gac aca cag aca gac gcc acc tct 998 Glu Ala Gln Ala Leu Leu Ser Gly Asp Thr Gln Thr Asp Ala Thr Ser 255 260 265 ttc tac gac cgc gtc tgg gcc gcc atc agg gac aag tac cgg tcc gag 1046 Phe Tyr Asp Arg Val Trp Ala Ala Ile Arg Asp Lys Tyr Arg Ser Glu 270 275 280 gca ctg ccc acc gac ctt cca gca cct tct gcc cct gac ttg acg gag 1094 Ala Leu Pro Thr Asp Leu Pro Ala Pro Ser Ala Pro Asp Leu Thr Glu 285 290 295 ccc aag gag gag cag ccg cca gtg ccc tcg tcg ccc aca gag gag gag 1142 Pro Lys Glu Glu Gln Pro Pro Val Pro Ser Ser Pro Thr Glu Glu Glu 300 305 310 315 gag gag gag gag gag gag gaa gaa gag gct gaa gaa gag gag gag gag 1190 Glu Glu Glu Glu Glu Glu Glu Glu Glu Ala Glu Glu Glu Glu Glu Glu 320 325 330 gag gat tcc gag gtg cag ggg gag cag ccc aag gag gcc cca ccg cca 1238 Glu Asp Ser Glu Val Gln Gly Glu Gln Pro Lys Glu Ala Pro Pro Pro 335 340 345 ctg tca ccc ccg cag ccg gcc agc cct gct gag gaa gac aaa atg ccg 1286 Leu Ser Pro Pro Gln Pro Ala Ser Pro Ala Glu Glu Asp Lys Met Pro 350 355 360 ccc tac gac gag cag acg cag gcc ttc atc gat gct gcc cag gag gcc 1334 Pro Tyr Asp Glu Gln Thr Gln Ala Phe Ile Asp Ala Ala Gln Glu Ala 365 370 375 cgc aac aag ttc gag gag gcc gag cgg tcg ctg aag gac atg gag gag 1382 Arg Asn Lys Phe Glu Glu Ala Glu Arg Ser Leu Lys Asp Met Glu Glu 380 385 390 395 tcc atc agg aac ctg gag caa gag att tct ttt gac ttt ggc ccc aac 1430 Ser Ile Arg Asn Leu Glu Gln Glu Ile Ser Phe Asp Phe Gly Pro Asn 400 405 410 ggg gag ttt gct tac ctg tac agc cag tgc tac gag ctc acc acc aac 1478 Gly Glu Phe Ala Tyr Leu Tyr Ser Gln Cys Tyr Glu Leu Thr Thr Asn 415 420 425 gaa tac gtc tac cgc ctc tgc ccc ttc aag ctt gtc tcg cag aaa ccc 1526 Glu Tyr Val Tyr Arg Leu Cys Pro Phe Lys Leu Val Ser Gln Lys Pro 430 435 440 aaa ctc ggg ggc tct ccc acc agc ctt ggc acc tgg ggc tca tgg att 1574 Lys Leu Gly Gly Ser Pro Thr Ser Leu Gly Thr Trp Gly Ser Trp Ile 445 450 455 ggc ccc gac cac gac aag ttc agt gcc atg aag tat gag caa ggc acg 1622 Gly Pro Asp His Asp Lys Phe Ser Ala Met Lys Tyr Glu Gln Gly Thr 460 465 470 475 ggc tgc tgg cag ggc ccc aac cgc tcc acc acc gtg cgc ctc ctg tgc 1670 Gly Cys Trp Gln Gly Pro Asn Arg Ser Thr Thr Val Arg Leu Leu Cys 480 485 490 ggg aaa gag acc atg gtg acc agc acc aca gag ccc agt cgc tgc gag 1718 Gly Lys Glu Thr Met Val Thr Ser Thr Thr Glu Pro Ser Arg Cys Glu 495 500 505 tac ctc atg gag ctg atg acg cca gcc gcc tgc ccg gag cca ccg cct 1766 Tyr Leu Met Glu Leu Met Thr Pro Ala Ala Cys Pro Glu Pro Pro Pro 510 515 520 gaa gca ccc acc gaa gac gac cat gac gag ctc tag ctgg atgggcgcag 1816 Glu Ala Pro Thr Glu Asp Asp His Asp Glu Leu * 525 530 agaaccttca agaaggcatg aagccagccc ctgcagtgcc gtccacccgc ccctctgggc 1876 ctgcctgtgg ctctgttgcc ctcctctgtg gcggcaggac ctttgtgggg cttcgtgccc 1936 tgctctgggg cccaggcggg gctggtccac attcccaggc cccaacagcc ttcaaagatg 1996 ggtaaaggag cttgccctcc ctgggccccc caccttggtg actcgcccca ccacccccag 2056 ccctgtccct gccacccctc ctagtgggga ctagtgaatg acttgacctg tgacctcaat 2116 acaataaatg tgatccccca ccccaaaaaa aaaa 2150 28 1411 DNA Homo sapiens CDS (96)..(1256) 28 catcgatcgc attcggtacc ggtccggaat tcccgggtcg acccacgcgt ccgggtgtgg 60 ctgtgccgtt ggtcctgtgc ggtcacttag ccaag atg cct gag gaa acc cag 113 Met Pro Glu Glu Thr Gln 1 5 acc caa gac caa ccg atg gag gag gag gag gtt gag acg ttc gcc ttt 161 Thr Gln Asp Gln Pro Met Glu Glu Glu Glu Val Glu Thr Phe Ala Phe 10 15 20 cag gca gaa att gcc cag ttg atg tca ttg atc atc aat act ttc tac 209 Gln Ala Glu Ile Ala Gln Leu Met Ser Leu Ile Ile Asn Thr Phe Tyr 25 30 35 tcg aac aaa gag atc ttt ctg aga gag ctc att tca aat tca tca gat 257 Ser Asn Lys Glu Ile Phe Leu Arg Glu Leu Ile Ser Asn Ser Ser Asp 40 45 50 gca ttg gac aaa atc cgg tat gaa agc ttg aca gat ccc agt aaa tta 305 Ala Leu Asp Lys Ile Arg Tyr Glu Ser Leu Thr Asp Pro Ser Lys Leu 55 60 65 70 gac tct ggg aaa gag ctg cat att aac ctt ata ccg aac aaa caa gat 353 Asp Ser Gly Lys Glu Leu His Ile Asn Leu Ile Pro Asn Lys Gln Asp 75 80 85 cga act ctc act att gtg gat act gga att gga atg acc aag gct gac 401 Arg Thr Leu Thr Ile Val Asp Thr Gly Ile Gly Met Thr Lys Ala Asp 90 95 100 ttg atc aat aac ctt ggt act atc gcc aag tct ggg acc aaa gcg ttc 449 Leu Ile Asn Asn Leu Gly Thr Ile Ala Lys Ser Gly Thr Lys Ala Phe 105 110 115 atg gaa gct ttg cag gct ggt gca gat atc tct atg att ggc cag ttc 497 Met Glu Ala Leu Gln Ala Gly Ala Asp Ile Ser Met Ile Gly Gln Phe 120 125 130 ggt gtt ggt ttt tat tct gct tat ttg gtt gct gag aaa gta act gtg 545 Gly Val Gly Phe Tyr Ser Ala Tyr Leu Val Ala Glu Lys Val Thr Val 135 140 145 150 atc acc aaa cat aac gat gat gag cag tac gct tgg gag tcc tca gca 593 Ile Thr Lys His Asn Asp Asp Glu Gln Tyr Ala Trp Glu Ser Ser Ala 155 160 165 ggg gga tca ttc aca gtg agg aca gac aca ggt gaa cct atg ggt cgt 641 Gly Gly Ser Phe Thr Val Arg Thr Asp Thr Gly Glu Pro Met Gly Arg 170 175 180 gga aca aaa gtt atc cta cac ctg aaa gaa gac caa act gag tac ttg 689 Gly Thr Lys Val Ile Leu His Leu Lys Glu Asp Gln Thr Glu Tyr Leu 185 190 195 gag gaa cga aga ata aag gag att gtg aag aaa cat tct cag ttt att 737 Glu Glu Arg Arg Ile Lys Glu Ile Val Lys Lys His Ser Gln Phe Ile 200 205 210 gga tat ccc att act ctt ttt gtg gag aag gaa cgt gat aaa gaa gta 785 Gly Tyr Pro Ile Thr Leu Phe Val Glu Lys Glu Arg Asp Lys Glu Val 215 220 225 230 agc gat gat gag gct gaa gaa aag gaa gac aaa gaa gaa gaa aaa aaa 833 Ser Asp Asp Glu Ala Glu Glu Lys Glu Asp Lys Glu Glu Glu Lys Lys 235 240 245 aaa gaa gag aaa gag tcg gaa gac aaa cct gaa att gaa gat gtt ggt 881 Lys Glu Glu Lys Glu Ser Glu Asp Lys Pro Glu Ile Glu Asp Val Gly 250 255 260 tct gat gag gaa gaa gaa aag aag gat ggt gac aag aag aag aag aag 929 Ser Asp Glu Glu Glu Glu Lys Lys Asp Gly Asp Lys Lys Lys Lys Lys 265 270 275 aag att aag gaa aag tac atc gat caa gaa gag ctc aac aaa aca aag 977 Lys Ile Lys Glu Lys Tyr Ile Asp Gln Glu Glu Leu Asn Lys Thr Lys 280 285 290 ccc atc tgg acc aga aat ccc gac gat att act aat gag gag tac gga 1025 Pro Ile Trp Thr Arg Asn Pro Asp Asp Ile Thr Asn Glu Glu Tyr Gly 295 300 305 310 gaa ttc tat aag agc ttg acc aat gac tgg gaa gat cac ttg gca gtg 1073 Glu Phe Tyr Lys Ser Leu Thr Asn Asp Trp Glu Asp His Leu Ala Val 315 320 325 aag cat ttt tca gtt gaa gga cag ttg gaa ttc aga gcc ctt cta ttt 1121 Lys His Phe Ser Val Glu Gly Gln Leu Glu Phe Arg Ala Leu Leu Phe 330 335 340 gtc cca cga cgt gct cct ttt gat ctg ttt gaa aac aga aag aaa aag 1169 Val Pro Arg Arg Ala Pro Phe Asp Leu Phe Glu Asn Arg Lys Lys Lys 345 350 355 aac aat atc aaa ttg tat gta cgc aga gtt ttc atc atg gat aac tgt 1217 Asn Asn Ile Lys Leu Tyr Val Arg Arg Val Phe Ile Met Asp Asn Cys 360 365 370 gag gag cta atc cct gaa tat ctg agt aag tat aga tag gaaaaataat 1266 Glu Glu Leu Ile Pro Glu Tyr Leu Ser Lys Tyr Arg * 375 380 385 cactgtcact gattaaagaa gtactttctg ggtgggcatg gtggctcaca cctataatcc 1326 tagcactttg ggaggccggg gtgggcagat cacttgaggt caggagttca agaccagcct 1386 gggcaacatg gtgaaacccc atttt 1411 29 1280 DNA Homo sapiens CDS (138)..(965) 29 cgagttagat agcgtggcaa gctgcctagc gtttaaactt aagcttggta ccgagctcgg 60 atccactagt ccagtgtggt ggaattcctc acgttgcggt cttactcacg ttgcggcctt 120 cctcgcgtcc ggccggg atg aag ccg atc cta ctg cag ggc cat gag cgg 170 Met Lys Pro Ile Leu Leu Gln Gly His Glu Arg 1 5 10 tcc att acg cag att aag tat aac cgc gaa gga gac ctc ctc ttt act 218 Ser Ile Thr Gln Ile Lys Tyr Asn Arg Glu Gly Asp Leu Leu Phe Thr 15 20 25 gtg gcc aag gac cct atc gtc aat gta tgg tac tct gtg aat ggt gag 266 Val Ala Lys Asp Pro Ile Val Asn Val Trp Tyr Ser Val Asn Gly Glu 30 35 40 agg ctg ggc acc tac atg ggc cat acc gga gct gtg tgg tgt gtg gac 314 Arg Leu Gly Thr Tyr Met Gly His Thr Gly Ala Val Trp Cys Val Asp 45 50 55 gct gac tgg gac acc aag cat gtc ctc act ggc tca gct gac aac agc 362 Ala Asp Trp Asp Thr Lys His Val Leu Thr Gly Ser Ala Asp Asn Ser 60 65 70 75 tgt cgt ctc tgg gac tgt gaa aca gac aac aat gag ccc tac atg aag 410 Cys Arg Leu Trp Asp Cys Glu Thr Asp Asn Asn Glu Pro Tyr Met Lys 80 85 90 atc cct tgc aat gac tct aaa atc acc agt gct gtt tgg gga ccc ctg 458 Ile Pro Cys Asn Asp Ser Lys Ile Thr Ser Ala Val Trp Gly Pro Leu 95 100 105 ggg gag tgc atc atc gct ggc cat gag agt gga gag ctc aac cag tat 506 Gly Glu Cys Ile Ile Ala Gly His Glu Ser Gly Glu Leu Asn Gln Tyr 110 115 120 agt gcc aag tct gga gag gtg ttg gtg aat gtt aag gag cac tcc cgg 554 Ser Ala Lys Ser Gly Glu Val Leu Val Asn Val Lys Glu His Ser Arg 125 130 135 cag atc aac gac atc cag tta tcc agg gac atg acc atg ttt gtg acc 602 Gln Ile Asn Asp Ile Gln Leu Ser Arg Asp Met Thr Met Phe Val Thr 140 145 150 155 gcg tcc aag gac aac aca gcc aag ctt ttt gac tcc aca act ctt gaa 650 Ala Ser Lys Asp Asn Thr Ala Lys Leu Phe Asp Ser Thr Thr Leu Glu 160 165 170 cat cag aag act ttc cgg aca gaa cgt cct gtc aac tca gct gcc ctc 698 His Gln Lys Thr Phe Arg Thr Glu Arg Pro Val Asn Ser Ala Ala Leu 175 180 185 tcc ccc aac tat gac cat gtg gtc ctg ggc ggt ggt cag gaa gcc atg 746 Ser Pro Asn Tyr Asp His Val Val Leu Gly Gly Gly Gln Glu Ala Met 190 195 200 gat gta acc aca acc tcc acc agg att ggc aag ttt gag gcc agg ttc 794 Asp Val Thr Thr Thr Ser Thr Arg Ile Gly Lys Phe Glu Ala Arg Phe 205 210 215 ttc cat ttg gcc ttt gaa gaa gag ttt gga aga gtc aag ggt cac ttt 842 Phe His Leu Ala Phe Glu Glu Glu Phe Gly Arg Val Lys Gly His Phe 220 225 230 235 gga cct atc aac agt gtt gcc ttc cat cct gat ggc aag agc tac agc 890 Gly Pro Ile Asn Ser Val Ala Phe His Pro Asp Gly Lys Ser Tyr Ser 240 245 250 agc ggc ggc gaa gat ggt tac gtc cgt atc cat tac ttc gac cca cag 938 Ser Gly Gly Glu Asp Gly Tyr Val Arg Ile His Tyr Phe Asp Pro Gln 255 260 265 tac ttc gaa ttt gag ttt gag gct taa gaagc tggatctcct gccgggcgtg 990 Tyr Phe Glu Phe Glu Phe Glu Ala * 270 275 gtggctcatg cctgtaatcc caccactttt tttttaaggc aggcggatca cctgaggtca 1050 ggagtttaag accagcctga ccaacatgga gaaacctcgt ctctactaaa aatacaaaaa 1110 tacaaaaatt agccaggcat ggtggcacac gcctatagtc ccagctactc aggaggctga 1170 ggcaggagaa tcacttgaac ccaggaggca gaggttgcag tgagctgaga tcacgtcatt 1230 gcactccatc ctgagccaca agagcaaaac tccgtctcaa aaaaaaaaaa 1280 30 1374 DNA Homo sapiens CDS (106)..(1188) misc_feature (1)...(1374) n = a,t,c or g 30 aattgaaccc cctcgcatgc ggtaccggtc cggaattccc gggtcgaccc acgcgtccgc 60 ggacgcgtcc gcccacgcgt ccgccttcct cgcgtcacag ccggg atg aag ccg 114 Met Lys Pro 1 atc cta ctg cag ggc cat gag cgg tcc att acg cag att aag tat aac 162 Ile Leu Leu Gln Gly His Glu Arg Ser Ile Thr Gln Ile Lys Tyr Asn 5 10 15 cgc gaa gga gac ctc ctc ttt act gtg gcc aag gac cct atc gtc aat 210 Arg Glu Gly Asp Leu Leu Phe Thr Val Ala Lys Asp Pro Ile Val Asn 20 25 30 35 gta tgg tac tct gtg aat ggt gag agg ctg ggc acc tac atg ggc cat 258 Val Trp Tyr Ser Val Asn Gly Glu Arg Leu Gly Thr Tyr Met Gly His 40 45 50 acc gga gct gtg tgg tgt gtg gac gct gac tgg gac acc aag cat gtc 306 Thr Gly Ala Val Trp Cys Val Asp Ala Asp Trp Asp Thr Lys His Val 55 60 65 ctc act ggc tca gct gac aac agc tgt cgt ctc tgg gac tgt gaa aca 354 Leu Thr Gly Ser Ala Asp Asn Ser Cys Arg Leu Trp Asp Cys Glu Thr 70 75 80 gga aag cag ctg gcc ctt ctc aag acc aat tcg gct gtc cgg acc tgc 402 Gly Lys Gln Leu Ala Leu Leu Lys Thr Asn Ser Ala Val Arg Thr Cys 85 90 95 ggt ttt gac ttt ggg ggc aac atc atc atg ttc tcc acg gac aag cag 450 Gly Phe Asp Phe Gly Gly Asn Ile Ile Met Phe Ser Thr Asp Lys Gln 100 105 110 115 atg ggc tac cag tgc ttt gtg agc ttt ttt gac ctg cgg gat ccg agc 498 Met Gly Tyr Gln Cys Phe Val Ser Phe Phe Asp Leu Arg Asp Pro Ser 120 125 130 cag att gac aac aat gag ccc tac atg aag atc cct tgc aat gac tct 546 Gln Ile Asp Asn Asn Glu Pro Tyr Met Lys Ile Pro Cys Asn Asp Ser 135 140 145 aaa atc acc agt gct gtt tgg gga ccc ctg ggg gag tgc atc atc gct 594 Lys Ile Thr Ser Ala Val Trp Gly Pro Leu Gly Glu Cys Ile Ile Ala 150 155 160 ggc cat gag agt gga gag ctc aac cag tat agt gcc aag tct gga gag 642 Gly His Glu Ser Gly Glu Leu Asn Gln Tyr Ser Ala Lys Ser Gly Glu 165 170 175 gtg ttg gtg aat gtt aag gag cac tcc cgg cag atc aac gac atc cag 690 Val Leu Val Asn Val Lys Glu His Ser Arg Gln Ile Asn Asp Ile Gln 180 185 190 195 tta tcc agg gac atg acc atg ttt gtg acc gcg tcc aag gac aac aca 738 Leu Ser Arg Asp Met Thr Met Phe Val Thr Ala Ser Lys Asp Asn Thr 200 205 210 gcc aag ctt ttt gac tcc aca act ctt gaa cat cag aag act ttc cgg 786 Ala Lys Leu Phe Asp Ser Thr Thr Leu Glu His Gln Lys Thr Phe Arg 215 220 225 aca gaa cgt cct gtc aac tca gct gcc ctc tcc ccc aac tat gac cat 834 Thr Glu Arg Pro Val Asn Ser Ala Ala Leu Ser Pro Asn Tyr Asp His 230 235 240 gtg gtc ctg ggc ggt ggt cag gaa gcc atg gat gta acc aca acc tcc 882 Val Val Leu Gly Gly Gly Gln Glu Ala Met Asp Val Thr Thr Thr Ser 245 250 255 acc agg att ggc aag ttt gag gcc agg ttc ttc cat ttg gct ttg aag 930 Thr Arg Ile Gly Lys Phe Glu Ala Arg Phe Phe His Leu Ala Leu Lys 260 265 270 275 aag agt ttg gaa gag tca agg gtc act ttg gac cta tca aca gtg ttg 978 Lys Ser Leu Glu Glu Ser Arg Val Thr Leu Asp Leu Ser Thr Val Leu 280 285 290 cct tcc atc ctg atg gca aga gct aca gca gcg gcg gcg aag atg gtt 1026 Pro Ser Ile Leu Met Ala Arg Ala Thr Ala Ala Ala Ala Lys Met Val 295 300 305 acg tcc gta tcc att act tcg acc cac agt act tcg aat ttg agt ttg 1074 Thr Ser Val Ser Ile Thr Ser Thr His Ser Thr Ser Asn Leu Ser Leu 310 315 320 agg ctt aag aag ctg gat ctc ctg ccg ggc gtg gtg gct cat gcc tgt 1122 Arg Leu Lys Lys Leu Asp Leu Leu Pro Gly Val Val Ala His Ala Cys 325 330 335 aat ccc acc act ttt ttt ttt aag gca ggc gga tca cct gag gtc agg 1170 Asn Pro Thr Thr Phe Phe Phe Lys Ala Gly Gly Ser Pro Glu Val Arg 340 345 350 355 agt tta aga cca gcc tga ccaaca tgggagaaac ctcgtgttct acttaaaaat 1224 Ser Leu Arg Pro Ala * 360 acaaaaaatt taggccaggg catgggttgg cacacgtcta tagtcccagc tacttgggaa 1284 ctcgggaggc tgaggcggga gaatgacctg agcccgggag gcggagcttn cggtgggcag 1344 agattgcgcc attgcactcc agcctgggcg 1374 31 1614 DNA Homo sapiens CDS (116)..(1465) 31 ggaacccttt tcctgccgta ccggtccgga attcccgggt cgacgatttc gtggaggagc 60 gcgcgggcac agggtgccgc tgaccgaggc gtgcaaagac tccagaattg gaggc atg 118 Met 1 atg aag act ctg ctg ctg ttt gtg ggg ctg ctg ctg acc tgg gag agt 166 Met Lys Thr Leu Leu Leu Phe Val Gly Leu Leu Leu Thr Trp Glu Ser 5 10 15 ggg cag gtc ctg ggg gac cag acg gtc tca gac aat gag ctc cag gaa 214 Gly Gln Val Leu Gly Asp Gln Thr Val Ser Asp Asn Glu Leu Gln Glu 20 25 30 atg tcc aat cag gga agt aag tac gtc aat aag gaa att caa aat gct 262 Met Ser Asn Gln Gly Ser Lys Tyr Val Asn Lys Glu Ile Gln Asn Ala 35 40 45 gtc aac ggg gtg aaa cag ata aag act ctc ata gaa aaa aca aac gaa 310 Val Asn Gly Val Lys Gln Ile Lys Thr Leu Ile Glu Lys Thr Asn Glu 50 55 60 65 gag cgc aag aca ctg ctc agc aac cta gaa gaa gcc aag aag aaa aaa 358 Glu Arg Lys Thr Leu Leu Ser Asn Leu Glu Glu Ala Lys Lys Lys Lys 70 75 80 gag gat gcc cta aat gag acc agg gaa tca gag aca aag ctg aag gag 406 Glu Asp Ala Leu Asn Glu Thr Arg Glu Ser Glu Thr Lys Leu Lys Glu 85 90 95 ctc cca gga gtg tgc aat gag acc atg atg gcc ctc tgg gaa gag tgt 454 Leu Pro Gly Val Cys Asn Glu Thr Met Met Ala Leu Trp Glu Glu Cys 100 105 110 aag ccc tgc ctg aaa cag acc tgc atg aag ttc tac gca cgc gtc tgc 502 Lys Pro Cys Leu Lys Gln Thr Cys Met Lys Phe Tyr Ala Arg Val Cys 115 120 125 aga agt ggc tca ggc ctg gtt ggc cgc cag ctt gag gag ttc ctg aac 550 Arg Ser Gly Ser Gly Leu Val Gly Arg Gln Leu Glu Glu Phe Leu Asn 130 135 140 145 cag agc tcg ccc ttc tac ttc tgg atg aat ggt gac cgc atc gac tcc 598 Gln Ser Ser Pro Phe Tyr Phe Trp Met Asn Gly Asp Arg Ile Asp Ser 150 155 160 ctg ctg gag aac gac cgg cag cag acg cac atg ctg gat gtc atg cag 646 Leu Leu Glu Asn Asp Arg Gln Gln Thr His Met Leu Asp Val Met Gln 165 170 175 gac cac ttc agc cgc gcg tcc agc atc ata gac gag ctc ttc cag gac 694 Asp His Phe Ser Arg Ala Ser Ser Ile Ile Asp Glu Leu Phe Gln Asp 180 185 190 agg ttc ttc acc cgg gag ccc cag gat acc tac cac tac ctg ccc ttc 742 Arg Phe Phe Thr Arg Glu Pro Gln Asp Thr Tyr His Tyr Leu Pro Phe 195 200 205 agc ctg ccc cac cgg agg cct cac ttc ttc ttt ccc aag tcc cgc atc 790 Ser Leu Pro His Arg Arg Pro His Phe Phe Phe Pro Lys Ser Arg Ile 210 215 220 225 gtc cgc agc ttg atg ccc ttc tct ccg tac gag ccc ctg aac ttc cac 838 Val Arg Ser Leu Met Pro Phe Ser Pro Tyr Glu Pro Leu Asn Phe His 230 235 240 gcc atg ttc cag ccc ttc ctt gag atg ata cac gag gct cag cag gcc 886 Ala Met Phe Gln Pro Phe Leu Glu Met Ile His Glu Ala Gln Gln Ala 245 250 255 atg gac atc cac ttc cac agc ccg gcc ttc cag cac ccg cca aca gaa 934 Met Asp Ile His Phe His Ser Pro Ala Phe Gln His Pro Pro Thr Glu 260 265 270 ttc ata cga gaa ggc gac gat gac cgg act gtg tgc cgg gag atc cgc 982 Phe Ile Arg Glu Gly Asp Asp Asp Arg Thr Val Cys Arg Glu Ile Arg 275 280 285 cac aac tcc acg ggc tgc ctg cgg atg aag gac cag tgt gac aag tgc 1030 His Asn Ser Thr Gly Cys Leu Arg Met Lys Asp Gln Cys Asp Lys Cys 290 295 300 305 cgg gag atc ttg tct gtg gac tgt tcc acc aac aac ccc tcc cag gct 1078 Arg Glu Ile Leu Ser Val Asp Cys Ser Thr Asn Asn Pro Ser Gln Ala 310 315 320 aag ctg cgg cgg gag ctc gac gaa tcc ctc cag gtc gct gag agg ttg 1126 Lys Leu Arg Arg Glu Leu Asp Glu Ser Leu Gln Val Ala Glu Arg Leu 325 330 335 acc agg aaa tac aac gag ctg cta aag tcc tac cag tgg aag atg ctc 1174 Thr Arg Lys Tyr Asn Glu Leu Leu Lys Ser Tyr Gln Trp Lys Met Leu 340 345 350 aac acc tcc tcc ttg ctg gag cag ctg aac gag cag ttt aac tgg gtg 1222 Asn Thr Ser Ser Leu Leu Glu Gln Leu Asn Glu Gln Phe Asn Trp Val 355 360 365 tcc cgg ctg gca aac ctc acg caa ggc gaa gac cag tac tat ctg cgg 1270 Ser Arg Leu Ala Asn Leu Thr Gln Gly Glu Asp Gln Tyr Tyr Leu Arg 370 375 380 385 gtc acc acg gtg gct tcc cac act tct gac tcg gac gtt cct tcc ggt 1318 Val Thr Thr Val Ala Ser His Thr Ser Asp Ser Asp Val Pro Ser Gly 390 395 400 gtc act gag gtg gtc gtg aag ctc ttt gac tct gat ccc atc act gtg 1366 Val Thr Glu Val Val Val Lys Leu Phe Asp Ser Asp Pro Ile Thr Val 405 410 415 acg gtc cct gta gaa gtc tcc agg aag aac cct aaa ttt atg gag acc 1414 Thr Val Pro Val Glu Val Ser Arg Lys Asn Pro Lys Phe Met Glu Thr 420 425 430 gtg gcg gag aaa gcg ctg cag gaa tac cgc aaa aag cac cgg gag gag 1462 Val Ala Glu Lys Ala Leu Gln Glu Tyr Arg Lys Lys His Arg Glu Glu 435 440 445 tga gatg tggatgttgc ttttgcacct acgggggcat ctgagtccag ctccccccaa 1519 * 450 gatgagctgc agccccccag agagagctct gcacgtcacc aagtaaccag gccctgtatt 1579 gcggccgctc tgggggttct cccacgacgc aacac 1614 32 2340 DNA Homo sapiens CDS (95)..(1945) 32 cagcccccat tgaaccccat gaggtaccgg tccggaattc ccgggtcgac ccacgcgtcc 60 ggactggcgg gactgcgcgg cggcaacagc agac atg tcg ggg gtc cgg ggc 112 Met Ser Gly Val Arg Gly 1 5 ctg tcg cgg ctg ctg agc gct cgg cgc ctg gcg ctg gcc aag gcg tgg 160 Leu Ser Arg Leu Leu Ser Ala Arg Arg Leu Ala Leu Ala Lys Ala Trp 10 15 20 cca aca gtg ttg caa aca gga acc cga ggt ttt cac ttc act gtt gtt 208 Pro Thr Val Leu Gln Thr Gly Thr Arg Gly Phe His Phe Thr Val Val 25 30 35 ggg aac aag agg gca tct gct aaa gtt tca gat tcc att tct gct cag 256 Gly Asn Lys Arg Ala Ser Ala Lys Val Ser Asp Ser Ile Ser Ala Gln 40 45 50 tat cca gta gtg gat cat gaa ttt gat gca gtg gtg gta ggc gct gga 304 Tyr Pro Val Val Asp His Glu Phe Asp Ala Val Val Val Gly Ala Gly 55 60 65 70 ggg gca ggc ttg cga gct gca ttt ggc ctt tct gag gca ggg ttt aat 352 Gly Ala Gly Leu Arg Ala Ala Phe Gly Leu Ser Glu Ala Gly Phe Asn 75 80 85 aca gca tgt gtt acc aag ctg ttt cct acc agg tca cac act gtt gca 400 Thr Ala Cys Val Thr Lys Leu Phe Pro Thr Arg Ser His Thr Val Ala 90 95 100 gca cag cta gaa aat tat ggc atg ccg ttt agc aga act gaa gat ggg 448 Ala Gln Leu Glu Asn Tyr Gly Met Pro Phe Ser Arg Thr Glu Asp Gly 105 110 115 aag att tat cag cgt gca ttt ggt gga cag agc ctc aag ttt gga aag 496 Lys Ile Tyr Gln Arg Ala Phe Gly Gly Gln Ser Leu Lys Phe Gly Lys 120 125 130 ggc ggg cag gcc cat cgg tgc tgc tgt gtg gct gat cgg act ggc cac 544 Gly Gly Gln Ala His Arg Cys Cys Cys Val Ala Asp Arg Thr Gly His 135 140 145 150 tcg cta ttg cac acc tta tat gga cgg tct ctg cga tat gat acc agc 592 Ser Leu Leu His Thr Leu Tyr Gly Arg Ser Leu Arg Tyr Asp Thr Ser 155 160 165 tat ttt gtg gag tat ttt gcc ttg gat ctc ctg atg gag aac ggg gag 640 Tyr Phe Val Glu Tyr Phe Ala Leu Asp Leu Leu Met Glu Asn Gly Glu 170 175 180 tgc cgt ggt gtc atc gca ctg tgc ata gag gac ggg tcc atc cat cgc 688 Cys Arg Gly Val Ile Ala Leu Cys Ile Glu Asp Gly Ser Ile His Arg 185 190 195 ata aga gca aag aac act gtt gtt gcc aca gga ggc tac ggg cgc acc 736 Ile Arg Ala Lys Asn Thr Val Val Ala Thr Gly Gly Tyr Gly Arg Thr 200 205 210 tac ttc agc tgc acg tct gcc cac acc agc act ggc gac ggc acg gcc 784 Tyr Phe Ser Cys Thr Ser Ala His Thr Ser Thr Gly Asp Gly Thr Ala 215 220 225 230 atg atc acc agg gca ggc ctt cct tgc cag gac cta gag ttt gtt cag 832 Met Ile Thr Arg Ala Gly Leu Pro Cys Gln Asp Leu Glu Phe Val Gln 235 240 245 ttc cac ccc aca ggc ata tat ggt gct ggt tgt ctc att acg gaa gga 880 Phe His Pro Thr Gly Ile Tyr Gly Ala Gly Cys Leu Ile Thr Glu Gly 250 255 260 tgt cgt gga gag gga ggc att ctc att aac agt caa ggc gaa agg ttt 928 Cys Arg Gly Glu Gly Gly Ile Leu Ile Asn Ser Gln Gly Glu Arg Phe 265 270 275 atg gag cga tac gcc cct gtc gcg aag gac ctg gcg tct aga gat gtg 976 Met Glu Arg Tyr Ala Pro Val Ala Lys Asp Leu Ala Ser Arg Asp Val 280 285 290 gtg tct cgg tcc atg act ctg gag atc cga gaa gga aga ggc tgt ggc 1024 Val Ser Arg Ser Met Thr Leu Glu Ile Arg Glu Gly Arg Gly Cys Gly 295 300 305 310 cct gag aaa gat cac gtc tac ctg cag ctg cac cac cta cct cca gag 1072 Pro Glu Lys Asp His Val Tyr Leu Gln Leu His His Leu Pro Pro Glu 315 320 325 cag ctg gcc acg cgc ctg cct ggc att tca gag aca gcc atg atc ttc 1120 Gln Leu Ala Thr Arg Leu Pro Gly Ile Ser Glu Thr Ala Met Ile Phe 330 335 340 gct ggc gtg gac gtc acg aag gag ccg atc cct gtc ctc ccc acc gtg 1168 Ala Gly Val Asp Val Thr Lys Glu Pro Ile Pro Val Leu Pro Thr Val 345 350 355 cat tat aac atg ggc ggc att ccc acc aac tac aag ggg cag gtc ctg 1216 His Tyr Asn Met Gly Gly Ile Pro Thr Asn Tyr Lys Gly Gln Val Leu 360 365 370 agg cac gtg aat ggc cag gat cag att gtg ccc ggc ctg tac gcc tgt 1264 Arg His Val Asn Gly Gln Asp Gln Ile Val Pro Gly Leu Tyr Ala Cys 375 380 385 390 ggg gag gcc gcc tgt gcc tcg gta cat ggt gcc aac cgc ctc ggg gca 1312 Gly Glu Ala Ala Cys Ala Ser Val His Gly Ala Asn Arg Leu Gly Ala 395 400 405 aac tcg ctc ttg gac ctg gtt gtc ttt ggt cgg gca tgt gcc ctg agc 1360 Asn Ser Leu Leu Asp Leu Val Val Phe Gly Arg Ala Cys Ala Leu Ser 410 415 420 atc gaa gag tca tgc agg cct gga gat aaa gtc cct cca att aaa cca 1408 Ile Glu Glu Ser Cys Arg Pro Gly Asp Lys Val Pro Pro Ile Lys Pro 425 430 435 aac gct ggg gaa gaa tct gtc atg aat ctt gac aaa ttg aga ttt gct 1456 Asn Ala Gly Glu Glu Ser Val Met Asn Leu Asp Lys Leu Arg Phe Ala 440 445 450 gat gga agc ata aga aca tcg gaa ctg cga ctc agc atg cag aag tca 1504 Asp Gly Ser Ile Arg Thr Ser Glu Leu Arg Leu Ser Met Gln Lys Ser 455 460 465 470 atg caa aat cat gct gcc gtg ttc cgt gtg gga agc gtg ttg caa gaa 1552 Met Gln Asn His Ala Ala Val Phe Arg Val Gly Ser Val Leu Gln Glu 475 480 485 ggt tgt ggg aaa atc agc aag ctc tat gga gac cta aag cac ctg aag 1600 Gly Cys Gly Lys Ile Ser Lys Leu Tyr Gly Asp Leu Lys His Leu Lys 490 495 500 acg ttc gac cgg gga atg gtc tgg aac acg gac ctg gtg gag acc ctg 1648 Thr Phe Asp Arg Gly Met Val Trp Asn Thr Asp Leu Val Glu Thr Leu 505 510 515 gag ctg cag aac ctg atg ctg tgt gcg ctg cag acc atc tac gga gca 1696 Glu Leu Gln Asn Leu Met Leu Cys Ala Leu Gln Thr Ile Tyr Gly Ala 520 525 530 gag gca cgg aag gag tca cgg ggc gcg cat gcc agg gaa gac tac aag 1744 Glu Ala Arg Lys Glu Ser Arg Gly Ala His Ala Arg Glu Asp Tyr Lys 535 540 545 550 gtg cgg att gat gag tac gat tac tcc aag ccc atc cag ggg caa cag 1792 Val Arg Ile Asp Glu Tyr Asp Tyr Ser Lys Pro Ile Gln Gly Gln Gln 555 560 565 aag aag ccc ttt gag gag cac tgg agg aag cac acc ctg tcc tat gtg 1840 Lys Lys Pro Phe Glu Glu His Trp Arg Lys His Thr Leu Ser Tyr Val 570 575 580 gac gtt ggc act ggg aag gtc act ctg gaa tat aga ccc gtg atc gac 1888 Asp Val Gly Thr Gly Lys Val Thr Leu Glu Tyr Arg Pro Val Ile Asp 585 590 595 aaa act ttg aac gag gct gac tgt gcc acc gtc ccg cca gcc att cgc 1936 Lys Thr Leu Asn Glu Ala Asp Cys Ala Thr Val Pro Pro Ala Ile Arg 600 605 610 tcc tac tga tgagaca agatgtggtg atgacagaat cagcttttgt aattatgtat 1992 Ser Tyr * 615 aatagctcat gcatgtgtcc atgtcataac tgtcttcata cgcttctgca ctctggggaa 2052 gaaggagtac attgaaggga gattggcacc tagtggctgg gagcttgcca ggaacccagt 2112 ggccagggag cgtggcactt acctttgtcc cttgcttcat tcttgtgaga tgataaaact 2172 gggcacagct cttaaataaa atataaatga acaaactttc ttttatttcc aaatccattt 2232 gaaatatttt actgttgtga ctttagtcat atttgttgac ctaaaaatca aatgtaatct 2292 ttgtattgtg ttacatcaaa atccagatat tttgtatagt ttcttttt 2340 33 997 DNA Homo sapiens CDS (53)..(853) misc_feature (1)...(997) n = a,t,c or g 33 accggnccgg aattcccggg tcgacgattt cgtcgcggcg gcaacagcag ac atg 55 Met 1 tcg ggg gtc cgg ggc ctg tcg cgg ctg ctg agc gct cgg cgc ctg gcg 103 Ser Gly Val Arg Gly Leu Ser Arg Leu Leu Ser Ala Arg Arg Leu Ala 5 10 15 ctg gcc aag gcg tgg cca aca gtg ttg caa aca gga acc cga ggt ttt 151 Leu Ala Lys Ala Trp Pro Thr Val Leu Gln Thr Gly Thr Arg Gly Phe 20 25 30 cac ttc act gtt gat ggg aac aag agg gca tct gct aaa gtt tca gat 199 His Phe Thr Val Asp Gly Asn Lys Arg Ala Ser Ala Lys Val Ser Asp 35 40 45 tcc att tct gct cag tat cca gta gtg gat cat gaa ttt gat gca gtg 247 Ser Ile Ser Ala Gln Tyr Pro Val Val Asp His Glu Phe Asp Ala Val 50 55 60 65 gtg gta ggc gct gga ggg gca ggc ttg cga gct gca ttt ggc ctt tct 295 Val Val Gly Ala Gly Gly Ala Gly Leu Arg Ala Ala Phe Gly Leu Ser 70 75 80 gag gca ggg ttt aat aca gca tgt gtt acc aag ctg ttt cct acc agg 343 Glu Ala Gly Phe Asn Thr Ala Cys Val Thr Lys Leu Phe Pro Thr Arg 85 90 95 tca cac act gtt gca gca cag gga gga atc aat gct gct ctg ggg aac 391 Ser His Thr Val Ala Ala Gln Gly Gly Ile Asn Ala Ala Leu Gly Asn 100 105 110 atg gag gag gac aac tgg agg tgg cat ttc tac gac acc gtg aag ggc 439 Met Glu Glu Asp Asn Trp Arg Trp His Phe Tyr Asp Thr Val Lys Gly 115 120 125 tcc gac tgg ctg ggg gac cag gat gcc atc cac tac atg acg gag cag 487 Ser Asp Trp Leu Gly Asp Gln Asp Ala Ile His Tyr Met Thr Glu Gln 130 135 140 145 gcc ccc gcc gcc gtg gtc gag cta gaa aat tat ggc atg ccg ttt agc 535 Ala Pro Ala Ala Val Val Glu Leu Glu Asn Tyr Gly Met Pro Phe Ser 150 155 160 aga act gaa gat ggg aag att tat cag cgt gca ttt ggt gga cag agc 583 Arg Thr Glu Asp Gly Lys Ile Tyr Gln Arg Ala Phe Gly Gly Gln Ser 165 170 175 ctc aag ttt gga aag ggc ggg cag gcc cat cgg tgc tgc tgt gtg gct 631 Leu Lys Phe Gly Lys Gly Gly Gln Ala His Arg Cys Cys Cys Val Ala 180 185 190 gat cgg act ggc cac tcg cta ttg cac acc tta tat gga agg tct ctg 679 Asp Arg Thr Gly His Ser Leu Leu His Thr Leu Tyr Gly Arg Ser Leu 195 200 205 cga tat gat acc agc tat ttt gtg gag tat ttt gcc ttg gat ctc ctg 727 Arg Tyr Asp Thr Ser Tyr Phe Val Glu Tyr Phe Ala Leu Asp Leu Leu 210 215 220 225 atg gag aat ggg gag tgc cgt ggt gtc atc gca ctg tgc ata gaa gtc 775 Met Glu Asn Gly Glu Cys Arg Gly Val Ile Ala Leu Cys Ile Glu Val 230 235 240 gac gcg gcc gcg aat tcg gat cct cga gag atc tct ttt ttt ggg ttt 823 Asp Ala Ala Ala Asn Ser Asp Pro Arg Glu Ile Ser Phe Phe Gly Phe 245 250 255 ggt ggg gta tct tca tca tcg aat aga tag t tgtatacatc agccgtccaa 874 Gly Gly Val Ser Ser Ser Ser Asn Arg * 260 265 acttagtaat cgtccacata atcgtgagat gcacgtgatc tcactatatg acagtgatct 934 tgataacgca cacatcagtg gattgcatat ataattagta gcgtctcatg ctttacagat 994 gca 997 34 2247 DNA Homo sapiens CDS (158)..(607) 34 aaggatcctt aattaaatta atcccccccc cccccgggcg cgcgctgcgg gcagtgagtg 60 tggaggcgcg gacgcgcggc ggagctggaa ctgctgcagc tgctgccgcc gccggaggaa 120 ccttgatccc cgtgctccgg acaccccggg cctcgcc atg gct gac cag ctg act 175 Met Ala Asp Gln Leu Thr 1 5 gag gag cag att gca gag ttc aag gag gcc ttc tcc ctc ttt gac aag 223 Glu Glu Gln Ile Ala Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp Lys 10 15 20 gat gga gat ggc act atc acc acc aag gag ttg ggg aca gtg atg aga 271 Asp Gly Asp Gly Thr Ile Thr Thr Lys Glu Leu Gly Thr Val Met Arg 25 30 35 tcc ctg gga cag aac ccc act gaa gca gag ctg cag gat atg atc aat 319 Ser Leu Gly Gln Asn Pro Thr Glu Ala Glu Leu Gln Asp Met Ile Asn 40 45 50 gag gtg gat gca gat ggg aac ggg acc att gac ttc ccg gag ttc ctg 367 Glu Val Asp Ala Asp Gly Asn Gly Thr Ile Asp Phe Pro Glu Phe Leu 55 60 65 70 acc atg atg gcc aga aag atg aag gac aca gac agt gag gag gag atc 415 Thr Met Met Ala Arg Lys Met Lys Asp Thr Asp Ser Glu Glu Glu Ile 75 80 85 cga gag gcg ttc cgt gtc ttt gac aag gat ggg aat ggc tac atc agc 463 Arg Glu Ala Phe Arg Val Phe Asp Lys Asp Gly Asn Gly Tyr Ile Ser 90 95 100 gcc gca gag ctg cgt cac gta atg acg aac ctg ggg gag aag ctg acc 511 Ala Ala Glu Leu Arg His Val Met Thr Asn Leu Gly Glu Lys Leu Thr 105 110 115 gat gag gag gtg gat gag atg atc agg gag gct gac atc gat gga gat 559 Asp Glu Glu Val Asp Glu Met Ile Arg Glu Ala Asp Ile Asp Gly Asp 120 125 130 ggc cag gtc aat tat gaa gag ttt gta cag atg atg act gca aag tga 607 Gly Gln Val Asn Tyr Glu Glu Phe Val Gln Met Met Thr Ala Lys * 135 140 145 aggccccccg ggcagctggc gatgcccgtt ctcttgatct ctctcttctc gcgcgcgcac 667 tctctcttca acactcccct gcgtaccccg gttctagcaa acaccaattg attgactgag 727 aatctgataa agcaacaaaa gatttgtccc aagctgcatg attgctcttt ctccttcttc 787 cctgagtctc tctccatgcc cctcatctct tccttttgcc ctcgcctctt ccatccatgt 847 cttccaaggc ctgatgcatt cataagttga agccctcccc agatcccctt ggggagcctc 907 tgccctcctc cagcccggat ggctctcctc cattttggtt tgtttcctct tgtttgtcat 967 cttattttgg gtgctggggt ggctgccagc cctgtcccgg gacctgctgg gagggacaag 1027 aggccctccc ccaggcagaa gagcatgccc tttgccgttg catgcaacca gccctgtgat 1087 tccacgtgca gatcccagca gcctgttggg gcaggggtgc caagagaggc attccagaag 1147 gactgagggg gcgttgagga attgtggcgt tgactggatg tggcccagga gggggtcgag 1207 ggggccaact cacagaaggg gactgacagt gggcaacact cacatcccac tggctgctgt 1267 tctgaaacca tctgattggc tttctgaggt ttggctgggt ggggactgct catttggcca 1327 ctctgcagat tggacttgcc cgcgttcctg aagcgctctc gagctgttct gtaaatacct 1387 ggtgctaaca tcccatgccg ctccctcctc acgatgcacc caccgccctg agggcccgtc 1447 ctaggaatgg atgtggggat ggtcgctttg taatgtgctg gttctctttt tttttctttc 1507 ccctctatgg cccttaagac tttcattttg ttcagaacca tgctgggcta gctaaagggt 1567 ggggagaggg aagatgggcc ccaccacgct ctcaagagaa cgcacctgca ataaaacagt 1627 cttgtcggcc agctgcccag gggacggcag ctacagcagc ctctgcgtcc tggtccgcca 1687 gcacctcccg cttctccgtg gtgacttggc gccgcttcct cacatctgtg ctccgtgccc 1747 tcttccctgc ctcttccctc gcccacctgc ctgcccccat actcccccag cggagagcat 1807 gatccgtgcc cttgcttctg actttcgcct ctgggacaag taagtcaatg tgggcagttc 1867 agtcgtctgg gttttttccc cttttctgtt catttcatct ggctcccccc accacctccc 1927 caccccaccc cccaccccct gcttcccctc actgcccagg tcgatcaagt ggcttttcct 1987 gggacctgcc cagctttgag aatctcttct catccaccct ctggcaccca gcctctgagg 2047 gaaggaggga tggggcatag tgggagaccc agccaagagc tgagggtaag ggcaggtagg 2107 cgtgaggctg tggacatttt cggaatgttt tggttttgtt ttttttaaac cgggcaatat 2167 tgtgttcagt tcaagctgtg aagaaaaata tatatcaatg ttttccaata aaatacagtg 2227 actacctgaa aaaaaaaaaa 2247 35 1550 DNA Homo sapiens CDS (129)..(1220) 35 cgcacgtacc ggtccggaat acccggatcg acgctttcgt ggaagaagga agaagagggt 60 agaggaggag agggaggagg aggagggagg tggcggcgcc gtggcggagg agcaggagca 120 ggaggggg atg gag agg aga agg ctc ctg ggt ggc atg gcg ctc ctg ctc 170 Met Glu Arg Arg Arg Leu Leu Gly Gly Met Ala Leu Leu Leu 1 5 10 ctc cag gcg ctg ccc agc ccc ttg tca gcc agg gct gaa ccc ccg cag 218 Leu Gln Ala Leu Pro Ser Pro Leu Ser Ala Arg Ala Glu Pro Pro Gln 15 20 25 30 gat aag gaa gcc tgt gtg ggt acc aac aat caa agc tac atc tgt gac 266 Asp Lys Glu Ala Cys Val Gly Thr Asn Asn Gln Ser Tyr Ile Cys Asp 35 40 45 aca gga cac tgc tgt gga cag tct cag tgc tgc aac tac tac tat gaa 314 Thr Gly His Cys Cys Gly Gln Ser Gln Cys Cys Asn Tyr Tyr Tyr Glu 50 55 60 ctc tgg tgg ttc tgg ctg gtg tgg acc atc atc atc atc ctg agc tgc 362 Leu Trp Trp Phe Trp Leu Val Trp Thr Ile Ile Ile Ile Leu Ser Cys 65 70 75 tgc tgt gtt tgc cac cac cgc cga gcc aag cac cgc ctt cag gcc cag 410 Cys Cys Val Cys His His Arg Arg Ala Lys His Arg Leu Gln Ala Gln 80 85 90 cag cgg caa cat gaa atc aac ctg atc gct tac cga gaa gcc cac aat 458 Gln Arg Gln His Glu Ile Asn Leu Ile Ala Tyr Arg Glu Ala His Asn 95 100 105 110 tac tca gcg ctg cca ttt tat ttc agg ttt ttg cca aac tat tta cta 506 Tyr Ser Ala Leu Pro Phe Tyr Phe Arg Phe Leu Pro Asn Tyr Leu Leu 115 120 125 cct cct tat gag gaa gtg gtg aac cga cct cca act cct ccc cca cca 554 Pro Pro Tyr Glu Glu Val Val Asn Arg Pro Pro Thr Pro Pro Pro Pro 130 135 140 tac agt gcc ttc cag cta cag cag cag cag ctg ctg cct cca cag tgt 602 Tyr Ser Ala Phe Gln Leu Gln Gln Gln Gln Leu Leu Pro Pro Gln Cys 145 150 155 ggc cct gca ggt ggc agt ccc ccg ggc atc gat ccc acc agg gga tcc 650 Gly Pro Ala Gly Gly Ser Pro Pro Gly Ile Asp Pro Thr Arg Gly Ser 160 165 170 cag ggg gca cag agc agc ccc ttg tct gag ccc agc aga agc agc aca 698 Gln Gly Ala Gln Ser Ser Pro Leu Ser Glu Pro Ser Arg Ser Ser Thr 175 180 185 190 aga ccc cca agc atc gct gac cct gat ccc tct gac cta cca gtt gac 746 Arg Pro Pro Ser Ile Ala Asp Pro Asp Pro Ser Asp Leu Pro Val Asp 195 200 205 cga gca gcc acc aaa gcc cca ggg atg gag ccc agt ggc tct gtg gct 794 Arg Ala Ala Thr Lys Ala Pro Gly Met Glu Pro Ser Gly Ser Val Ala 210 215 220 ggc ctg ggg gag ctg gac ccg ggg gcc ttc ctg gac aaa gat gca gaa 842 Gly Leu Gly Glu Leu Asp Pro Gly Ala Phe Leu Asp Lys Asp Ala Glu 225 230 235 tgt agg gag gag ctg ctg aaa gat gac agc tct gaa cac ggc gca ccc 890 Cys Arg Glu Glu Leu Leu Lys Asp Asp Ser Ser Glu His Gly Ala Pro 240 245 250 gac agc aaa gag aag acg cct ggg aga cat cgc cgc ttc aca ggt gac 938 Asp Ser Lys Glu Lys Thr Pro Gly Arg His Arg Arg Phe Thr Gly Asp 255 260 265 270 tcg ggc att gaa gtg tgt gtg tgc aac cgg ggc cac cat gac gat gac 986 Ser Gly Ile Glu Val Cys Val Cys Asn Arg Gly His His Asp Asp Asp 275 280 285 ctc aaa gag ttc aac aca ctc atc gat gat gct ctg gat ggg ccc ctg 1034 Leu Lys Glu Phe Asn Thr Leu Ile Asp Asp Ala Leu Asp Gly Pro Leu 290 295 300 gac ttc tgc gac agc tgc cat gtg cgg ccc cct ggt gat gag gag gaa 1082 Asp Phe Cys Asp Ser Cys His Val Arg Pro Pro Gly Asp Glu Glu Glu 305 310 315 ggc ctc tgt cag ccc tct gag gag cag gct cga gag cct ggg cac ccg 1130 Gly Leu Cys Gln Pro Ser Glu Glu Gln Ala Arg Glu Pro Gly His Pro 320 325 330 cac ctg cca cgg ccg ccc gca tgc ctg ctg ctg aac acc atc aac gag 1178 His Leu Pro Arg Pro Pro Ala Cys Leu Leu Leu Asn Thr Ile Asn Glu 335 340 345 350 cag gac tct ccc aac tcc cag agc agc agc tcc ccc agc tag agcaggt 1227 Gln Asp Ser Pro Asn Ser Gln Ser Ser Ser Ser Pro Ser * 355 360 cctgccagca cccagcaact tggcaaagca accagggtag gggagaacca cgagagaagc 1287 attaagtgac tttcaaagac tttcagagta cagccacttg gttccttttt gtttgttttc 1347 cttctcctct cctgcatttt cctccatctc caggtacagt tcggggtgtg gatgcctctt 1407 cctccacaag ggcacagtgt tgtggagggc taagttggtt ctgtgactca ttcctcatac 1467 cctaactcca tctcctttct ttaaagtcaa atctcaccta cctgtttggg tcagagagat 1527 gtgttttaaa agcccccaag gaa 1550 36 1517 DNA Homo sapiens CDS (62)..(1165) 36 taccggtccg gaattcccgg gtcgacgatt tcgtctttcc tgcctctgat tccgggctgt 60 c atg gcg acc ccc aac aat ctg acc ccc acc aac tgc agc tgg tgg 106 Met Ala Thr Pro Asn Asn Leu Thr Pro Thr Asn Cys Ser Trp Trp 1 5 10 15 ccc atc tcc gcg ctg gag agc gat gcg gcc aag cca gcg gag gcc ccc 154 Pro Ile Ser Ala Leu Glu Ser Asp Ala Ala Lys Pro Ala Glu Ala Pro 20 25 30 gac gct ccc gag gcg gcc agc ccc gcc cat tgg ccc agg gag agc ctg 202 Asp Ala Pro Glu Ala Ala Ser Pro Ala His Trp Pro Arg Glu Ser Leu 35 40 45 gtt ctg tac cac tgg acc cag tcc ttc agc tcg cag aag gtg cgg ctg 250 Val Leu Tyr His Trp Thr Gln Ser Phe Ser Ser Gln Lys Val Arg Leu 50 55 60 gtg atc gcc gag aag ggc ctg gtg tgc gag gag cgg gac gtg agc ctg 298 Val Ile Ala Glu Lys Gly Leu Val Cys Glu Glu Arg Asp Val Ser Leu 65 70 75 cca cag agc gag cac aag gag ccc tgg ttc atg cgg ctc aac ctg ggc 346 Pro Gln Ser Glu His Lys Glu Pro Trp Phe Met Arg Leu Asn Leu Gly 80 85 90 95 gag gag gtg ccc gtc atc atc cac cgc gac aac atc atc agt gac tat 394 Glu Glu Val Pro Val Ile Ile His Arg Asp Asn Ile Ile Ser Asp Tyr 100 105 110 gac cag atc att gac tat gtg gag cgc acc ttc aca gga gag cac gtg 442 Asp Gln Ile Ile Asp Tyr Val Glu Arg Thr Phe Thr Gly Glu His Val 115 120 125 gtg gcc ctg atg ccc gag gtg ggc agc ctg cag cac gca cgg gtg ctg 490 Val Ala Leu Met Pro Glu Val Gly Ser Leu Gln His Ala Arg Val Leu 130 135 140 cag tac cgg gag ctg ctg gac gca ctg ccc atg gat gcc tac acg cat 538 Gln Tyr Arg Glu Leu Leu Asp Ala Leu Pro Met Asp Ala Tyr Thr His 145 150 155 ggc tgc atc ctg cat ccc gag ctc acc acc gac tcc atg atc ccc aag 586 Gly Cys Ile Leu His Pro Glu Leu Thr Thr Asp Ser Met Ile Pro Lys 160 165 170 175 tac gcc acg gcc gag atc cgc aga cat tta gcc aat gcc acc acg gac 634 Tyr Ala Thr Ala Glu Ile Arg Arg His Leu Ala Asn Ala Thr Thr Asp 180 185 190 ctc atg aaa ctg gac cat gaa gag gag ccc cag ctc tcc gag ccc tac 682 Leu Met Lys Leu Asp His Glu Glu Glu Pro Gln Leu Ser Glu Pro Tyr 195 200 205 ctt tct aaa caa aag aag ctc atg gcc aag atc ttg gag cat gat gat 730 Leu Ser Lys Gln Lys Lys Leu Met Ala Lys Ile Leu Glu His Asp Asp 210 215 220 gtg agc tac ctg aag aag atc ctc ggg gaa ctg gcc atg gtg ctg gac 778 Val Ser Tyr Leu Lys Lys Ile Leu Gly Glu Leu Ala Met Val Leu Asp 225 230 235 cag att gag gcg gag ctg gag aag agg aag ctg gag aac gag ggg cag 826 Gln Ile Glu Ala Glu Leu Glu Lys Arg Lys Leu Glu Asn Glu Gly Gln 240 245 250 255 aaa tgc gag ctg tgg ctc tgt ggc tgt gcc ttc acc ctc gct gat gtc 874 Lys Cys Glu Leu Trp Leu Cys Gly Cys Ala Phe Thr Leu Ala Asp Val 260 265 270 ctc ctg gga gcc acc ctg cac cgc ctc aag ttc ctg gga ctg tcc aag 922 Leu Leu Gly Ala Thr Leu His Arg Leu Lys Phe Leu Gly Leu Ser Lys 275 280 285 aaa tac tgg gaa gat ggc agc cgg ccc aac ctg cag tcc ttc ttt gag 970 Lys Tyr Trp Glu Asp Gly Ser Arg Pro Asn Leu Gln Ser Phe Phe Glu 290 295 300 agg gtc cag aga cgc ttt gcc ttc cgg aaa gtc ctg ggt gac atc cac 1018 Arg Val Gln Arg Arg Phe Ala Phe Arg Lys Val Leu Gly Asp Ile His 305 310 315 acc acc ctg ctg tcg gcc gtc atc ccc aat gct ttc cgg ctg gtc aag 1066 Thr Thr Leu Leu Ser Ala Val Ile Pro Asn Ala Phe Arg Leu Val Lys 320 325 330 335 agg aaa ccc cca tcc ttc ttc ggg gcg tcc ttc ctc atg ggc tcc ctg 1114 Arg Lys Pro Pro Ser Phe Phe Gly Ala Ser Phe Leu Met Gly Ser Leu 340 345 350 ggt ggg atg ggc tac ttt gcc tac tgg tac ctc aag aaa aaa tac atc 1162 Gly Gly Met Gly Tyr Phe Ala Tyr Trp Tyr Leu Lys Lys Lys Tyr Ile 355 360 365 tag ggcc aggcctgggg cttggtgtct gactgtcggt gtctctgtgc tgtgtgattc 1219 * 368 cccgtgagct ctcagtaact cactgtctca tgaacacttg gacagccctc cccgcccttc 1279 gttctgagta ataataccgt cagtgtgaaa acattccgta gtttagaagt agacgttgcc 1339 aatgctgtga ctcaaggcca cggctctgct aaaagagaga gaggaagcga gagagagaga 1399 gaaaaaacaa aaaacagaaa acacgaatgc cttttctatc gattcaaggt ctcaagatgg 1459 aacttgtgga gactggttag gatctgaggg aactcttcca caggacaaag ccgacatc 1517 37 9098 DNA Homo sapiens CDS (75)..(9098) 37 gcacgagggt aggcttctgg gagcgaccgc tccgctcgtc tcgttggttc cggaggtcgc 60 tgcggcggtg ggaa atg ctg gcg cgc gcg gcg cgg ggc act ggg gcc ctt 110 Met Leu Ala Arg Ala Ala Arg Gly Thr Gly Ala Leu 1 5 10 ttg ctg agg ggc tct cta ctg gct tct ggc cgc gct ccg cgc cgc gcc 158 Leu Leu Arg Gly Ser Leu Leu Ala Ser Gly Arg Ala Pro Arg Arg Ala 15 20 25 tcc tct gga ttg ccc cga aac acc gtg gta ctg ttc gtg ccg cag cag 206 Ser Ser Gly Leu Pro Arg Asn Thr Val Val Leu Phe Val Pro Gln Gln 30 35 40 gag gcc tgg gtg gtg gag cga atg ggc cga ttc cac cgg atc ctg gag 254 Glu Ala Trp Val Val Glu Arg Met Gly Arg Phe His Arg Ile Leu Glu 45 50 55 60 cct ggt ttg aac atc ctc atc cct gtg tta gac cgg atc cga tat gtg 302 Pro Gly Leu Asn Ile Leu Ile Pro Val Leu Asp Arg Ile Arg Tyr Val 65 70 75 cag agt ctc aag gaa att gtc atc aac gtg cct gag cag tcg gct gtg 350 Gln Ser Leu Lys Glu Ile Val Ile Asn Val Pro Glu Gln Ser Ala Val 80 85 90 act ctc gac aat gta act ctg caa atc gat gga gtc ctt tac ctg cgc 398 Thr Leu Asp Asn Val Thr Leu Gln Ile Asp Gly Val Leu Tyr Leu Arg 95 100 105 atc atg gac cct tac aag gca agc tac ggt gtg gag gac cct gag tat 446 Ile Met Asp Pro Tyr Lys Ala Ser Tyr Gly Val Glu Asp Pro Glu Tyr 110 115 120 gcc gtc acc cag cta gct caa aca acc atg aga tca gag ctc ggc aaa 494 Ala Val Thr Gln Leu Ala Gln Thr Thr Met Arg Ser Glu Leu Gly Lys 125 130 135 140 ctc tct ctg gac aaa gtc ttc cgg gaa cgg gag tcc ctg aat gcc agc 542 Leu Ser Leu Asp Lys Val Phe Arg Glu Arg Glu Ser Leu Asn Ala Ser 145 150 155 att gtg gat gcc atc aac caa gct gct gac tgc tgg ggt atc cgc tgc 590 Ile Val Asp Ala Ile Asn Gln Ala Ala Asp Cys Trp Gly Ile Arg Cys 160 165 170 ctc cgt tat gag atc aag gat atc cat gtg cca ccc cgg gtg aaa gag 638 Leu Arg Tyr Glu Ile Lys Asp Ile His Val Pro Pro Arg Val Lys Glu 175 180 185 tct atg cag atg cag gtg gag gca gag cgg cgg aaa cgg gcc aca gtt 686 Ser Met Gln Met Gln Val Glu Ala Glu Arg Arg Lys Arg Ala Thr Val 190 195 200 cta gag tct gag ggg acc cga gag tcg gcc atc aat gtg gca gaa ggg 734 Leu Glu Ser Glu Gly Thr Arg Glu Ser Ala Ile Asn Val Ala Glu Gly 205 210 215 220 aag aaa cag gcc cag atc ctg gcc tcc gaa gca gaa aag gct gaa cag 782 Lys Lys Gln Ala Gln Ile Leu Ala Ser Glu Ala Glu Lys Ala Glu Gln 225 230 235 ata aat cag gca gca gga gag gcc agt gca gtt ctg gcg aag gcc aag 830 Ile Asn Gln Ala Ala Gly Glu Ala Ser Ala Val Leu Ala Lys Ala Lys 240 245 250 gct aaa gct gaa gct att cga atc ctg gct gca gct ctg aca caa cat 878 Ala Lys Ala Glu Ala Ile Arg Ile Leu Ala Ala Ala Leu Thr Gln His 255 260 265 aat gga gat gca gca gct tca ctg act gtg gcc gag cag tat gtc agc 926 Asn Gly Asp Ala Ala Ala Ser Leu Thr Val Ala Glu Gln Tyr Val Ser 270 275 280 gcg ttc tcc aaa ctg gcc aag gac tcc aac act atc cta ctg ccc tcc 974 Ala Phe Ser Lys Leu Ala Lys Asp Ser Asn Thr Ile Leu Leu Pro Ser 285 290 295 300 aac cct ggc gat gtc acc agc atg gtg gct cag gcc atg ggt gta tat 1022 Asn Pro Gly Asp Val Thr Ser Met Val Ala Gln Ala Met Gly Val Tyr 305 310 315 gga gcc ctc acc aaa gcc cca gtg cca ggg act cca gac tca ctc tcc 1070 Gly Ala Leu Thr Lys Ala Pro Val Pro Gly Thr Pro Asp Ser Leu Ser 320 325 330 agt ggg agc agc aga gat gtc cag ggt aca gat gca agt ctt gat gag 1118 Ser Gly Ser Ser Arg Asp Val Gln Gly Thr Asp Ala Ser Leu Asp Glu 335 340 345 gaa ctt gat cga gtc aag atg act tgg tcc cct gtc ccc aat ttt caa 1166 Glu Leu Asp Arg Val Lys Met Thr Trp Ser Pro Val Pro Asn Phe Gln 350 355 360 tta cta aat atc cca tca aac tgg ggc cag ccc cac gct cct ggc caa 1214 Leu Leu Asn Ile Pro Ser Asn Trp Gly Gln Pro His Ala Pro Gly Gln 365 370 375 380 act tcc acc gag gtg ccg gca gat ggc gat gga gcc acc gac ggg cca 1262 Thr Ser Thr Glu Val Pro Ala Asp Gly Asp Gly Ala Thr Asp Gly Pro 385 390 395 ctc tgt ctg gcg cac gca tct ctg tgc tgc cag gtc gca ggc gcc gcc 1310 Leu Cys Leu Ala His Ala Ser Leu Cys Cys Gln Val Ala Gly Ala Ala 400 405 410 gcc gcc gca ctt ccg ggt gcc att gca ggc ggc gcc gtc ggc tgg gcc 1358 Ala Ala Ala Leu Pro Gly Ala Ile Ala Gly Gly Ala Val Gly Trp Ala 415 420 425 cgg att ccc ctg cgg ctt cga tcc ctt tcc act ggg atg cag aaa gcc 1406 Arg Ile Pro Leu Arg Leu Arg Ser Leu Ser Thr Gly Met Gln Lys Ala 430 435 440 tca gtg ttg ctc ttc ctg gcc tgg gtc tgc ttc ctc ttc tac gct ggc 1454 Ser Val Leu Leu Phe Leu Ala Trp Val Cys Phe Leu Phe Tyr Ala Gly 445 450 455 460 att gcc ctc ttc acc agt ggc ttc ctg ctc acc cgt ttg gag ctc acc 1502 Ile Ala Leu Phe Thr Ser Gly Phe Leu Leu Thr Arg Leu Glu Leu Thr 465 470 475 aac cat agc agc tgc caa gag ccc cca ggc cct ggg tcc ctg cca tgg 1550 Asn His Ser Ser Cys Gln Glu Pro Pro Gly Pro Gly Ser Leu Pro Trp 480 485 490 ggg agc caa ggg aaa cct ggg gcc tgc tgg atg gct tcc cga ttt tcg 1598 Gly Ser Gln Gly Lys Pro Gly Ala Cys Trp Met Ala Ser Arg Phe Ser 495 500 505 cgg gtt gtg ttg gtg ctg ata gat gct ctg cga ttt gac ttc gcc cag 1646 Arg Val Val Leu Val Leu Ile Asp Ala Leu Arg Phe Asp Phe Ala Gln 510 515 520 ccc cag cat tca cac gtg cct aga gag cct cct gtc tcc cta ccc ttc 1694 Pro Gln His Ser His Val Pro Arg Glu Pro Pro Val Ser Leu Pro Phe 525 530 535 540 ctg ggc aaa cta agc tcc ttg cag agg atc ctg gag att cag ccc cac 1742 Leu Gly Lys Leu Ser Ser Leu Gln Arg Ile Leu Glu Ile Gln Pro His 545 550 555 cat gcc cgg ctc tac cga tct cag gtt gac cct cct acc acc acc atg 1790 His Ala Arg Leu Tyr Arg Ser Gln Val Asp Pro Pro Thr Thr Thr Met 560 565 570 cag cgc ctc aag gcc ctc acc act ggc tca ctg cct acc ttt att gat 1838 Gln Arg Leu Lys Ala Leu Thr Thr Gly Ser Leu Pro Thr Phe Ile Asp 575 580 585 gct ggt agt aac ttc gcc agc cac gcc ata gtg gaa gac aat ctc att 1886 Ala Gly Ser Asn Phe Ala Ser His Ala Ile Val Glu Asp Asn Leu Ile 590 595 600 aag cag ctc acc agt gca gga agg cgt gta gtc ttc atg gga gat gat 1934 Lys Gln Leu Thr Ser Ala Gly Arg Arg Val Val Phe Met Gly Asp Asp 605 610 615 620 acc tgg aaa gac ctt ttc cct ggt gct ttc tcc aaa gct ttc ttc ttc 1982 Thr Trp Lys Asp Leu Phe Pro Gly Ala Phe Ser Lys Ala Phe Phe Phe 625 630 635 cca tcc ttc aat gtc aga gac cta gac aca gtg gac aat ggc atc ctg 2030 Pro Ser Phe Asn Val Arg Asp Leu Asp Thr Val Asp Asn Gly Ile Leu 640 645 650 gaa cac ctc tac ccc acc atg gac agt ggt gaa tgg gac gtg ctg att 2078 Glu His Leu Tyr Pro Thr Met Asp Ser Gly Glu Trp Asp Val Leu Ile 655 660 665 gct cac ttc ctg ggt gtg gac cac tgt ggc cac aag cat ggc cct cac 2126 Ala His Phe Leu Gly Val Asp His Cys Gly His Lys His Gly Pro His 670 675 680 cac cct gaa atg gcc aag aaa ctt agc cag atg gac cag gtg atc cag 2174 His Pro Glu Met Ala Lys Lys Leu Ser Gln Met Asp Gln Val Ile Gln 685 690 695 700 gga ctt gtg gag cgt ctg gag aat gac aca ctg ctg gta gtg gct ggg 2222 Gly Leu Val Glu Arg Leu Glu Asn Asp Thr Leu Leu Val Val Ala Gly 705 710 715 gac cat ggg atg acc aca aat gga gac cat gga ggg gac agt gag ctg 2270 Asp His Gly Met Thr Thr Asn Gly Asp His Gly Gly Asp Ser Glu Leu 720 725 730 gag gtc tca gct gct ctc ttt ctg tat agc ccc aca gca gtc ttc ccc 2318 Glu Val Ser Ala Ala Leu Phe Leu Tyr Ser Pro Thr Ala Val Phe Pro 735 740 745 agc acc cca cca gag gag cca gag gtg att cct caa gtt agc ctt gtg 2366 Ser Thr Pro Pro Glu Glu Pro Glu Val Ile Pro Gln Val Ser Leu Val 750 755 760 ccc acg ctg gcc ctg ctg ctg ggc ctg ccc atc cca ttt ggg aat atc 2414 Pro Thr Leu Ala Leu Leu Leu Gly Leu Pro Ile Pro Phe Gly Asn Ile 765 770 775 780 ggg gaa gtg atg gct gag cta ttc tca ggg ggt gag gac tcc cag ccc 2462 Gly Glu Val Met Ala Glu Leu Phe Ser Gly Gly Glu Asp Ser Gln Pro 785 790 795 cac tcc tct gct tta gcc caa gcc tca gct ctc cat ctc aat gct cag 2510 His Ser Ser Ala Leu Ala Gln Ala Ser Ala Leu His Leu Asn Ala Gln 800 805 810 cag gtg tcc cga ttt ctt cat acc tac tca gct gct act cag gac ctt 2558 Gln Val Ser Arg Phe Leu His Thr Tyr Ser Ala Ala Thr Gln Asp Leu 815 820 825 caa gct aag gag ctt cat cag ctg cag aac ctc ttc tcc aag gcc tct 2606 Gln Ala Lys Glu Leu His Gln Leu Gln Asn Leu Phe Ser Lys Ala Ser 830 835 840 gct gac tac cag tgg ctt ctc cag agc ccc aag ggg gct gag gcg aca 2654 Ala Asp Tyr Gln Trp Leu Leu Gln Ser Pro Lys Gly Ala Glu Ala Thr 845 850 855 860 ctg ccg act gtg att gct gag ctg cag cag ttc ctg cgg gga gct cgg 2702 Leu Pro Thr Val Ile Ala Glu Leu Gln Gln Phe Leu Arg Gly Ala Arg 865 870 875 gcc atg tgc atc gag tct tgg gct cgt ttc tct ctg gtc cgc atg gcg 2750 Ala Met Cys Ile Glu Ser Trp Ala Arg Phe Ser Leu Val Arg Met Ala 880 885 890 ggg ggt act gct ctc ttg gct gct tcc tgc ttt atc tgc ctg ctg gca 2798 Gly Gly Thr Ala Leu Leu Ala Ala Ser Cys Phe Ile Cys Leu Leu Ala 895 900 905 tct cag tgg gca ata tcc cca ggc ttt cca ttc tgc cct cta ctc ctg 2846 Ser Gln Trp Ala Ile Ser Pro Gly Phe Pro Phe Cys Pro Leu Leu Leu 910 915 920 aca cct gtg gcc tgg ggc ctg gtt ggg gcc ata gcg tat gct gga ctc 2894 Thr Pro Val Ala Trp Gly Leu Val Gly Ala Ile Ala Tyr Ala Gly Leu 925 930 935 940 ctg gga act att gag ctg aag cta gat cta gtg ctt cta ggg gct gtg 2942 Leu Gly Thr Ile Glu Leu Lys Leu Asp Leu Val Leu Leu Gly Ala Val 945 950 955 gct gca gtg agc tca ttc ctc cct ttt ctg tgg aaa gcc tgg gct ggc 2990 Ala Ala Val Ser Ser Phe Leu Pro Phe Leu Trp Lys Ala Trp Ala Gly 960 965 970 tgg ggg tcc aag agg ccc ctg gca acc ctg ttt ccc atc cct ggg ccc 3038 Trp Gly Ser Lys Arg Pro Leu Ala Thr Leu Phe Pro Ile Pro Gly Pro 975 980 985 gtc ctg tta ctc ctg ctg ttt cgc ttg gct gtg ttc ttc tct gat agt 3086 Val Leu Leu Leu Leu Leu Phe Arg Leu Ala Val Phe Phe Ser Asp Ser 990 995 1000 ttt gtt gta gct gag gcc agg gcc acc ccc ttc ctt ttg ggc tca ttc 3134 Phe Val Val Ala Glu Ala Arg Ala Thr Pro Phe Leu Leu Gly Ser Phe 1005 1010 1015 1020 atc ctg ctc ctg gtt gtc cag ctt cac tgg gag ggc cag ctg ctt cca 3182 Ile Leu Leu Leu Val Val Gln Leu His Trp Glu Gly Gln Leu Leu Pro 1025 1030 1035 cct aag cta ctc aca atg ccc cgc ctt ggc act tca gcc aca aca aac 3230 Pro Lys Leu Leu Thr Met Pro Arg Leu Gly Thr Ser Ala Thr Thr Asn 1040 1045 1050 ccc cca cgg cac aat ggt gca tat gcc ctg agg ctt gga att ggg ttg 3278 Pro Pro Arg His Asn Gly Ala Tyr Ala Leu Arg Leu Gly Ile Gly Leu 1055 1060 1065 ctt tta tgt aca agg cta gct ggg ctt ttt cat cgt tgc cct gaa gag 3326 Leu Leu Cys Thr Arg Leu Ala Gly Leu Phe His Arg Cys Pro Glu Glu 1070 1075 1080 aca cct gtt tgc cac tcc tct ccc tgg ctg agt cct ctg gca tcc atg 3374 Thr Pro Val Cys His Ser Ser Pro Trp Leu Ser Pro Leu Ala Ser Met 1085 1090 1095 1100 gtg ggt ggt cga gcc aag aat ttg tgg tat gga gct tgt gtg gcg gcg 3422 Val Gly Gly Arg Ala Lys Asn Leu Trp Tyr Gly Ala Cys Val Ala Ala 1105 1110 1115 ctg gtg gcc ctg tta gct gcc gtg cgc ttg tgg ctt cgc cgc tat ggt 3470 Leu Val Ala Leu Leu Ala Ala Val Arg Leu Trp Leu Arg Arg Tyr Gly 1120 1125 1130 aat ctc aag agc ccc gag cca ccc atg ctc ttt gtg cgc tgg gga ctg 3518 Asn Leu Lys Ser Pro Glu Pro Pro Met Leu Phe Val Arg Trp Gly Leu 1135 1140 1145 ccc cta atg gca ttg ggt act gct gcc tac tgg gca ttg gcg tcg ggg 3566 Pro Leu Met Ala Leu Gly Thr Ala Ala Tyr Trp Ala Leu Ala Ser Gly 1150 1155 1160 gca gat gag gct ccc ccc cgt ctc cgg gtc ctg gtc tct ggg gca tcc 3614 Ala Asp Glu Ala Pro Pro Arg Leu Arg Val Leu Val Ser Gly Ala Ser 1165 1170 1175 1180 atg gtg ctg cct cgg gct gta gca ggg ctg gct gct tca ggg ctc gcg 3662 Met Val Leu Pro Arg Ala Val Ala Gly Leu Ala Ala Ser Gly Leu Ala 1185 1190 1195 ctg ctg ctc tgg aag cct gtg aca gtg ctg gtg aag gct ggg gca ggc 3710 Leu Leu Leu Trp Lys Pro Val Thr Val Leu Val Lys Ala Gly Ala Gly 1200 1205 1210 gct cca agg acc agg act gtc ctc act ccc ttc tca ggc ccc ccc act 3758 Ala Pro Arg Thr Arg Thr Val Leu Thr Pro Phe Ser Gly Pro Pro Thr 1215 1220 1225 tct caa gct gac ttg gat tat gtg gtc cct caa atc tac cga cac atg 3806 Ser Gln Ala Asp Leu Asp Tyr Val Val Pro Gln Ile Tyr Arg His Met 1230 1235 1240 cag gag gag ttc cgg ggc cgg tta gag agg acc aaa tct cag ggt ccc 3854 Gln Glu Glu Phe Arg Gly Arg Leu Glu Arg Thr Lys Ser Gln Gly Pro 1245 1250 1255 1260 ctg act gtg gct gct tat cag ttg ggg agt gtc tac tca gct gct atg 3902 Leu Thr Val Ala Ala Tyr Gln Leu Gly Ser Val Tyr Ser Ala Ala Met 1265 1270 1275 gtc aca gcc ctc acc ctg ttg gcc ttc cca ctt ctg ctg ttg cat gcg 3950 Val Thr Ala Leu Thr Leu Leu Ala Phe Pro Leu Leu Leu Leu His Ala 1280 1285 1290 gag cgc atc agc ctt gtg ttc ctg ctt ctg ttt ctg cag agc ttc ctt 3998 Glu Arg Ile Ser Leu Val Phe Leu Leu Leu Phe Leu Gln Ser Phe Leu 1295 1300 1305 ctc cta cat ctg ctt gct gct ggg ata ccc gtc acc acc cct ggt aaa 4046 Leu Leu His Leu Leu Ala Ala Gly Ile Pro Val Thr Thr Pro Gly Lys 1310 1315 1320 tat ctc agc tct gat tca ctt aaa gac aat agt gat agt caa ggg ctg 4094 Tyr Leu Ser Ser Asp Ser Leu Lys Asp Asn Ser Asp Ser Gln Gly Leu 1325 1330 1335 1340 cgg aag aga cag cag ccc cca ggg aat gaa gct gat gcc aga gtc aga 4142 Arg Lys Arg Gln Gln Pro Pro Gly Asn Glu Ala Asp Ala Arg Val Arg 1345 1350 1355 ccc gag gag gaa gag gag cca ctg atg gag atg cgg ctc cgg gat gcg 4190 Pro Glu Glu Glu Glu Glu Pro Leu Met Glu Met Arg Leu Arg Asp Ala 1360 1365 1370 cct cag cac ttc tat gca gca ctg ctg cag ctg ggc ctc aag tac ctc 4238 Pro Gln His Phe Tyr Ala Ala Leu Leu Gln Leu Gly Leu Lys Tyr Leu 1375 1380 1385 ttt atc ctt ggt att cag att ctg gcc tgt gcc ttg gca gcc tcc atc 4286 Phe Ile Leu Gly Ile Gln Ile Leu Ala Cys Ala Leu Ala Ala Ser Ile 1390 1395 1400 ctt cgc agg cat ctc atg gtc tgg aaa gtg ttt gcc cct aag ttc ata 4334 Leu Arg Arg His Leu Met Val Trp Lys Val Phe Ala Pro Lys Phe Ile 1405 1410 1415 1420 ttt gag gct gtg ggc ttc att gtg agc agc gtg gga ctt ctc ctg ggc 4382 Phe Glu Ala Val Gly Phe Ile Val Ser Ser Val Gly Leu Leu Leu Gly 1425 1430 1435 ata gct ttg gtg atg aga gtg gat ggt gct gta ctt ttg agc agt gca 4430 Ile Ala Leu Val Met Arg Val Asp Gly Ala Val Leu Leu Ser Ser Ala 1440 1445 1450 agt aca gag cgg cac tgc cag cag act aca cgc ggt aga aag ccg acc 4478 Ser Thr Glu Arg His Cys Gln Gln Thr Thr Arg Gly Arg Lys Pro Thr 1455 1460 1465 ttg gtg agc gtg ttg gtg ctc gac agt gag cag aga aag gat gga cga 4526 Leu Val Ser Val Leu Val Leu Asp Ser Glu Gln Arg Lys Asp Gly Arg 1470 1475 1480 tta cgg agc gcc ctc gtc tcc agt tac cgc ttt ctg gaa aca cca tcc 4574 Leu Arg Ser Ala Leu Val Ser Ser Tyr Arg Phe Leu Glu Thr Pro Ser 1485 1490 1495 1500 gcc ggg gcg gag ctg ttc cgc ccc gcc tcg gcc acc atg tcc cgc cag 4622 Ala Gly Ala Glu Leu Phe Arg Pro Ala Ser Ala Thr Met Ser Arg Gln 1505 1510 1515 acc acc tct gtg ggc tcc agc tgc ctg gac ctg tgg agg gaa aag aat 4670 Thr Thr Ser Val Gly Ser Ser Cys Leu Asp Leu Trp Arg Glu Lys Asn 1520 1525 1530 gac cgg ctc gtt cga cag gcc aag gtg gct cag aac tcc ggt ctg act 4718 Asp Arg Leu Val Arg Gln Ala Lys Val Ala Gln Asn Ser Gly Leu Thr 1535 1540 1545 ctg agg cga cag cag ttg gct cag gat gca ctg gaa ggg ctc aga ggg 4766 Leu Arg Arg Gln Gln Leu Ala Gln Asp Ala Leu Glu Gly Leu Arg Gly 1550 1555 1560 ctc ctc cat agt ctg caa ggg ctc cct gca gct gtt cct gtt ctt ccc 4814 Leu Leu His Ser Leu Gln Gly Leu Pro Ala Ala Val Pro Val Leu Pro 1565 1570 1575 1580 ttg gag ctg act gtc acc tgc aac ttc att atc ctg agg gca agc ttg 4862 Leu Glu Leu Thr Val Thr Cys Asn Phe Ile Ile Leu Arg Ala Ser Leu 1585 1590 1595 gcc cag ggt ttc aca gag gat cag gcc cag gat atc cag cgg agc cta 4910 Ala Gln Gly Phe Thr Glu Asp Gln Ala Gln Asp Ile Gln Arg Ser Leu 1600 1605 1610 gag aga gtg ctg gag aca cag gag cag cag ggg ccc agg ttg gaa cag 4958 Glu Arg Val Leu Glu Thr Gln Glu Gln Gln Gly Pro Arg Leu Glu Gln 1615 1620 1625 ggg ctc agg gag ctg tgg gac tct gtc ctt cgt gct tcc tgc ctt ctg 5006 Gly Leu Arg Glu Leu Trp Asp Ser Val Leu Arg Ala Ser Cys Leu Leu 1630 1635 1640 ccg gag ctg ctg tct gcc ctg cac cgc ctg gtt ggc ctg cag gct gcc 5054 Pro Glu Leu Leu Ser Ala Leu His Arg Leu Val Gly Leu Gln Ala Ala 1645 1650 1655 1660 ctc tgg ttg agt gct gac cgt ctt ggg gac ctg gcc ttg tta cta gag 5102 Leu Trp Leu Ser Ala Asp Arg Leu Gly Asp Leu Ala Leu Leu Leu Glu 1665 1670 1675 acc ctg aat ggc agc cag agt gga gcc tct aag gat ctg ctg tta ctt 5150 Thr Leu Asn Gly Ser Gln Ser Gly Ala Ser Lys Asp Leu Leu Leu Leu 1680 1685 1690 ctg aaa act tgg agt ccc cca gct gag gaa tta gat gct cca ttg acc 5198 Leu Lys Thr Trp Ser Pro Pro Ala Glu Glu Leu Asp Ala Pro Leu Thr 1695 1700 1705 ctg cag gat gcc cag gga ttg aag gat gtc ctc ctg aca gca ttt gcc 5246 Leu Gln Asp Ala Gln Gly Leu Lys Asp Val Leu Leu Thr Ala Phe Ala 1710 1715 1720 tac cgc caa ggt ctc cag gag ctg atc aca ggg aac cca gac aag gca 5294 Tyr Arg Gln Gly Leu Gln Glu Leu Ile Thr Gly Asn Pro Asp Lys Ala 1725 1730 1735 1740 cta agc agc ctt cat gaa gcg gcc tca ggc ctg tgt cca cgg cct gtg 5342 Leu Ser Ser Leu His Glu Ala Ala Ser Gly Leu Cys Pro Arg Pro Val 1745 1750 1755 ttg gtc cag gtg tac aca gca ctg ggg tcc tgt cac cgt aag atg gga 5390 Leu Val Gln Val Tyr Thr Ala Leu Gly Ser Cys His Arg Lys Met Gly 1760 1765 1770 aat cca cag aga gca ctg ttg tac ttg gtt gca gcc ctg aaa gag gga 5438 Asn Pro Gln Arg Ala Leu Leu Tyr Leu Val Ala Ala Leu Lys Glu Gly 1775 1780 1785 tca gcc tgg ggt cct cca ctt ctg gag gcc tct agg ctc tat cag caa 5486 Ser Ala Trp Gly Pro Pro Leu Leu Glu Ala Ser Arg Leu Tyr Gln Gln 1790 1795 1800 ctg ggg gac aca aca gca gag ctg gag agt ctg gag ctg cta gtt gag 5534 Leu Gly Asp Thr Thr Ala Glu Leu Glu Ser Leu Glu Leu Leu Val Glu 1805 1810 1815 1820 gcc ttg aat gtc cca tgc agt tcc aaa gcc ccg cag ttt ctc att gag 5582 Ala Leu Asn Val Pro Cys Ser Ser Lys Ala Pro Gln Phe Leu Ile Glu 1825 1830 1835 gta gaa tta cta ctg cca cca cct gac cta gcc tca ccc ctt cat tgt 5630 Val Glu Leu Leu Leu Pro Pro Pro Asp Leu Ala Ser Pro Leu His Cys 1840 1845 1850 ggc act cag agc cag acc aag cac ata cta gca agc agg tgc cta cag 5678 Gly Thr Gln Ser Gln Thr Lys His Ile Leu Ala Ser Arg Cys Leu Gln 1855 1860 1865 acg ggg agg gca gga gac gct gca gag cat tac ttg gac ctg ctg gcc 5726 Thr Gly Arg Ala Gly Asp Ala Ala Glu His Tyr Leu Asp Leu Leu Ala 1870 1875 1880 ctg ttg ctg gat agc tcg gag cca agg gtg ggg ccc tgt atg cct gag 5774 Leu Leu Leu Asp Ser Ser Glu Pro Arg Val Gly Pro Cys Met Pro Glu 1885 1890 1895 1900 gtg ttt ttg gag gca gcg gta gca ctg atc cag gca ggc aga gcc caa 5822 Val Phe Leu Glu Ala Ala Val Ala Leu Ile Gln Ala Gly Arg Ala Gln 1905 1910 1915 gat gcc ttg act cta tgt gag gag ttg ctc agc cgc aca tca tct ctg 5870 Asp Ala Leu Thr Leu Cys Glu Glu Leu Leu Ser Arg Thr Ser Ser Leu 1920 1925 1930 cta ccc aag atg tcc cgg ctg tgg gaa gat gcc aga aaa gga acc aag 5918 Leu Pro Lys Met Ser Arg Leu Trp Glu Asp Ala Arg Lys Gly Thr Lys 1935 1940 1945 gaa ctg cca tac tgc cca ctc tgg gtc tct gcc acc cac ctg ctt cag 5966 Glu Leu Pro Tyr Cys Pro Leu Trp Val Ser Ala Thr His Leu Leu Gln 1950 1955 1960 ggc cag gcc tgg gtt caa ctg ggt gcc caa aaa gtg gca att agt gaa 6014 Gly Gln Ala Trp Val Gln Leu Gly Ala Gln Lys Val Ala Ile Ser Glu 1965 1970 1975 1980 ttt agc agg tgc ctc gag ctg ctc ttc cgg gcc aca cct gag gaa aaa 6062 Phe Ser Arg Cys Leu Glu Leu Leu Phe Arg Ala Thr Pro Glu Glu Lys 1985 1990 1995 gaa caa ggg gca gct ttc aac tgt gag cag gga tgt aag tca gat gcg 6110 Glu Gln Gly Ala Ala Phe Asn Cys Glu Gln Gly Cys Lys Ser Asp Ala 2000 2005 2010 gca ctg cag cag ctt cgg gca gcc gcc cta att agt cgt gga ctg gaa 6158 Ala Leu Gln Gln Leu Arg Ala Ala Ala Leu Ile Ser Arg Gly Leu Glu 2015 2020 2025 tgg gta gcc agc ggc cag gat acc aaa gcc tta cag gac ttc ctc ctc 6206 Trp Val Ala Ser Gly Gln Asp Thr Lys Ala Leu Gln Asp Phe Leu Leu 2030 2035 2040 agt gtg cag atg tgc cca gtc tca gcg aag cgt ctg cga ccg tcg ttt 6254 Ser Val Gln Met Cys Pro Val Ser Ala Lys Arg Leu Arg Pro Ser Phe 2045 2050 2055 2060 gag tcg tcg ctg ccg ctg ccg ctg cca ctg cca ctg cca cct cgc gga 6302 Glu Ser Ser Leu Pro Leu Pro Leu Pro Leu Pro Leu Pro Pro Arg Gly 2065 2070 2075 tca gga gcc agc gtt gtt cgc ccg acg cct cgc tgc cgg ccc cgg ccc 6350 Ser Gly Ala Ser Val Val Arg Pro Thr Pro Arg Cys Arg Pro Arg Pro 2080 2085 2090 gcc cga ttg gct ccc tta gaa cgg acg tct ggg cct ggc cag gtc ttc 6398 Ala Arg Leu Ala Pro Leu Glu Arg Thr Ser Gly Pro Gly Gln Val Phe 2095 2100 2105 cgg ccc act ccg ccg ggg cgg cgc ccc ggg gct ttg ggg cgc cag tct 6446 Arg Pro Thr Pro Pro Gly Arg Arg Pro Gly Ala Leu Gly Arg Gln Ser 2110 2115 2120 gcc gtc cgg cct acc acc cgc cga aag cct ttg gtc ccc gga gag agc 6494 Ala Val Arg Pro Thr Thr Arg Arg Lys Pro Leu Val Pro Gly Glu Ser 2125 2130 2135 2140 agg ccc cgc gag ccc gag gcc cca gcc ggg ccc gag gag gac atc aaa 6542 Arg Pro Arg Glu Pro Glu Ala Pro Ala Gly Pro Glu Glu Asp Ile Lys 2145 2150 2155 gtt cag agg tta ggt aac ttg ccc aag atc aca att aag cag tgg cat 6590 Val Gln Arg Leu Gly Asn Leu Pro Lys Ile Thr Ile Lys Gln Trp His 2160 2165 2170 aat tgg aat tca gac cca atg ggt ctg act ata gag ttc ctg ctc tta 6638 Asn Trp Asn Ser Asp Pro Met Gly Leu Thr Ile Glu Phe Leu Leu Leu 2175 2180 2185 acc act ctt ctt tca aaa ggt gat gac cta tca aca gcc att ctc aaa 6686 Thr Thr Leu Leu Ser Lys Gly Asp Asp Leu Ser Thr Ala Ile Leu Lys 2190 2195 2200 cag aag aac cgt ccc aat cgg tta att gtt gat gaa gcc atc aat gag 6734 Gln Lys Asn Arg Pro Asn Arg Leu Ile Val Asp Glu Ala Ile Asn Glu 2205 2210 2215 2220 gac aac agt gtg gtg tcc ttg tcc cag ccc aag atg gat gaa ttg cag 6782 Asp Asn Ser Val Val Ser Leu Ser Gln Pro Lys Met Asp Glu Leu Gln 2225 2230 2235 ttg ttc cga ggt gac aca gtg ttg ctg aaa gga aag aag aga cga gaa 6830 Leu Phe Arg Gly Asp Thr Val Leu Leu Lys Gly Lys Lys Arg Arg Glu 2240 2245 2250 gct gtt tgc atc gtc ctt tct gat gat act tgt tct gat gag aag att 6878 Ala Val Cys Ile Val Leu Ser Asp Asp Thr Cys Ser Asp Glu Lys Ile 2255 2260 2265 cgg atg aat aga gtt gtt cgg aat aac ctt cgt gta cgc cta ggg gat 6926 Arg Met Asn Arg Val Val Arg Asn Asn Leu Arg Val Arg Leu Gly Asp 2270 2275 2280 gtc atc agc atc cag cca tgc cct gat gtg aag tac ggc aaa cgt atc 6974 Val Ile Ser Ile Gln Pro Cys Pro Asp Val Lys Tyr Gly Lys Arg Ile 2285 2290 2295 2300 cat gtg ctg ccc att gat gac aca gtg gaa ggc att act ggt aat ctc 7022 His Val Leu Pro Ile Asp Asp Thr Val Glu Gly Ile Thr Gly Asn Leu 2305 2310 2315 ttc gag gta tac ctt aag ccg tac ttc ctg gaa gcg tat cga ccc atc 7070 Phe Glu Val Tyr Leu Lys Pro Tyr Phe Leu Glu Ala Tyr Arg Pro Ile 2320 2325 2330 cgg aaa gga gac att ttt ctt gtc cgt ggt ggg atg cgt gct gtg gag 7118 Arg Lys Gly Asp Ile Phe Leu Val Arg Gly Gly Met Arg Ala Val Glu 2335 2340 2345 ttc aaa gtg gtg gaa aca gat cct agc cct tat tgc att gtt gct cca 7166 Phe Lys Val Val Glu Thr Asp Pro Ser Pro Tyr Cys Ile Val Ala Pro 2350 2355 2360 gac aca gtg atc cac tgc gaa ggg gag cct atc aaa cga gag gat gag 7214 Asp Thr Val Ile His Cys Glu Gly Glu Pro Ile Lys Arg Glu Asp Glu 2365 2370 2375 2380 gaa gag tcc ttg aat gaa gta ggg tat gat gac att ggt ggc tgc agg 7262 Glu Glu Ser Leu Asn Glu Val Gly Tyr Asp Asp Ile Gly Gly Cys Arg 2385 2390 2395 aag cag cta gct cag ata aag gag atg gtg gaa ctg ccc ctg aga cat 7310 Lys Gln Leu Ala Gln Ile Lys Glu Met Val Glu Leu Pro Leu Arg His 2400 2405 2410 cct gcc ctc ttt aag gca att ggt gtg aag cct cct aga gga atc ctg 7358 Pro Ala Leu Phe Lys Ala Ile Gly Val Lys Pro Pro Arg Gly Ile Leu 2415 2420 2425 ctt tac gga cct cct gga aca gga aag acc ctg att gct cga gct gta 7406 Leu Tyr Gly Pro Pro Gly Thr Gly Lys Thr Leu Ile Ala Arg Ala Val 2430 2435 2440 gca aat gag act gga gcc ttc ttc ttc ttg atc aat ggt cct gag atc 7454 Ala Asn Glu Thr Gly Ala Phe Phe Phe Leu Ile Asn Gly Pro Glu Ile 2445 2450 2455 2460 atg agc aaa ttg gct ggt gag tct gag agc aac ctt cgt aaa gcc ttt 7502 Met Ser Lys Leu Ala Gly Glu Ser Glu Ser Asn Leu Arg Lys Ala Phe 2465 2470 2475 gag gag gct gag aag aat gct cct gcc atc atc ttc att gat gag cta 7550 Glu Glu Ala Glu Lys Asn Ala Pro Ala Ile Ile Phe Ile Asp Glu Leu 2480 2485 2490 gat gcc atc gct ccc aaa aga gag aaa act cat ggc gag gtg gag cgg 7598 Asp Ala Ile Ala Pro Lys Arg Glu Lys Thr His Gly Glu Val Glu Arg 2495 2500 2505 cgc att gta tca cag ttg ttg acc ctc atg gat ggc cta aag cag agg 7646 Arg Ile Val Ser Gln Leu Leu Thr Leu Met Asp Gly Leu Lys Gln Arg 2510 2515 2520 gca cat gtg att gtt atg gca gca acc aac aga ccc aac agc att gac 7694 Ala His Val Ile Val Met Ala Ala Thr Asn Arg Pro Asn Ser Ile Asp 2525 2530 2535 2540 cca gct cta cgg cga ttt ggt cgc ttt gac agg gag gta gat att gga 7742 Pro Ala Leu Arg Arg Phe Gly Arg Phe Asp Arg Glu Val Asp Ile Gly 2545 2550 2555 att cct gat gct aca gga cgc tta gag att ctt cag atc cat acc aag 7790 Ile Pro Asp Ala Thr Gly Arg Leu Glu Ile Leu Gln Ile His Thr Lys 2560 2565 2570 aac atg aag ctg gca gat gat gtg gac ctg gaa cag gta gcc aat gag 7838 Asn Met Lys Leu Ala Asp Asp Val Asp Leu Glu Gln Val Ala Asn Glu 2575 2580 2585 act cac ggg cat gtg ggt gct gac tta gca gcc ctg tgc tca gag gct 7886 Thr His Gly His Val Gly Ala Asp Leu Ala Ala Leu Cys Ser Glu Ala 2590 2595 2600 gct ctg caa gcc atc cgc aag aag atg gat ctc att gac cta gag gat 7934 Ala Leu Gln Ala Ile Arg Lys Lys Met Asp Leu Ile Asp Leu Glu Asp 2605 2610 2615 2620 gag acc att gat gcc gag gtc atg aac tct cta gca gtt act atg gat 7982 Glu Thr Ile Asp Ala Glu Val Met Asn Ser Leu Ala Val Thr Met Asp 2625 2630 2635 gac ttc cgg gta agg acc aca ccc gtg cct cag tgg gcc ttg agc cag 8030 Asp Phe Arg Val Arg Thr Thr Pro Val Pro Gln Trp Ala Leu Ser Gln 2640 2645 2650 agt aac cca tca gca ctg cgg gaa acc gtg gta gag gtg cca cag gta 8078 Ser Asn Pro Ser Ala Leu Arg Glu Thr Val Val Glu Val Pro Gln Val 2655 2660 2665 acc tgg gaa gac atc ggg ggc cta gag gat gtc aaa cgt gag cta cag 8126 Thr Trp Glu Asp Ile Gly Gly Leu Glu Asp Val Lys Arg Glu Leu Gln 2670 2675 2680 gag ctg gtc cag tat cct gtg gag cac cca gac aaa ttc ctg aag ttt 8174 Glu Leu Val Gln Tyr Pro Val Glu His Pro Asp Lys Phe Leu Lys Phe 2685 2690 2695 2700 ggc atg aca cct tcc aag gga gtt ctg ttc tat gga cct cct ggc tgt 8222 Gly Met Thr Pro Ser Lys Gly Val Leu Phe Tyr Gly Pro Pro Gly Cys 2705 2710 2715 ggg aaa act ttg ttg gcc aaa gcc att gct aat gaa tgc cag gcc aac 8270 Gly Lys Thr Leu Leu Ala Lys Ala Ile Ala Asn Glu Cys Gln Ala Asn 2720 2725 2730 ttc atc tcc atc aag ggt cct gag ctg ctc acc atg tgg ttt ggg gag 8318 Phe Ile Ser Ile Lys Gly Pro Glu Leu Leu Thr Met Trp Phe Gly Glu 2735 2740 2745 tct gag gcc aat gtc aga gaa atc ttt gac aag gcc cgc caa gct gcc 8366 Ser Glu Ala Asn Val Arg Glu Ile Phe Asp Lys Ala Arg Gln Ala Ala 2750 2755 2760 ccc tgt gtg cta ttc ttt gat gag ctg gat tcg att gcc aag gct cgt 8414 Pro Cys Val Leu Phe Phe Asp Glu Leu Asp Ser Ile Ala Lys Ala Arg 2765 2770 2775 2780 gga ggt aac att gga gat ggt ggt ggg gct gct gac cga gtc atc aac 8462 Gly Gly Asn Ile Gly Asp Gly Gly Gly Ala Ala Asp Arg Val Ile Asn 2785 2790 2795 cag atc ctg aca gaa atg gat ggc atg tcc aca aaa aaa aat gtg ttc 8510 Gln Ile Leu Thr Glu Met Asp Gly Met Ser Thr Lys Lys Asn Val Phe 2800 2805 2810 atc att ggc gct acc aac cgg cct gac atc att gat cct gcc atc ctc 8558 Ile Ile Gly Ala Thr Asn Arg Pro Asp Ile Ile Asp Pro Ala Ile Leu 2815 2820 2825 aga cct ggc cgt ctt gat cag ctc atc tac atc cca ctt cct gat gag 8606 Arg Pro Gly Arg Leu Asp Gln Leu Ile Tyr Ile Pro Leu Pro Asp Glu 2830 2835 2840 aag tcc cgt gtt gcc atc ctc aag gct aac ctg cgc aag tcc cca gtt 8654 Lys Ser Arg Val Ala Ile Leu Lys Ala Asn Leu Arg Lys Ser Pro Val 2845 2850 2855 2860 gcc aag gca ggt gca aga tca tgg gct gat gtg gac ttg gag ttc ctg 8702 Ala Lys Ala Gly Ala Arg Ser Trp Ala Asp Val Asp Leu Glu Phe Leu 2865 2870 2875 gct aaa atg act aat ggc ttc tct gga gct gac ctg aca gag att tgc 8750 Ala Lys Met Thr Asn Gly Phe Ser Gly Ala Asp Leu Thr Glu Ile Cys 2880 2885 2890 cag cgt gct tgc aag ctg gcc atc cgt gaa tcc atc gag agt gag att 8798 Gln Arg Ala Cys Lys Leu Ala Ile Arg Glu Ser Ile Glu Ser Glu Ile 2895 2900 2905 agg cga gaa cga gag agg cag aca aac cca tca gcc atg gag gta gaa 8846 Arg Arg Glu Arg Glu Arg Gln Thr Asn Pro Ser Ala Met Glu Val Glu 2910 2915 2920 gag gat gat cca gtg cct gag atc cgt cga gat cac ttt gaa gaa gcc 8894 Glu Asp Asp Pro Val Pro Glu Ile Arg Arg Asp His Phe Glu Glu Ala 2925 2930 2935 2940 atg cgc ttt gcg cgc cgt tct gtc agt gac aat gac att cgg aag tat 8942 Met Arg Phe Ala Arg Arg Ser Val Ser Asp Asn Asp Ile Arg Lys Tyr 2945 2950 2955 gag atg ttt gcc cag acc ctt cag cag agt cgg ggc ttt ggc agc ttc 8990 Glu Met Phe Ala Gln Thr Leu Gln Gln Ser Arg Gly Phe Gly Ser Phe 2960 2965 2970 aga ttc cct tca ggg aac cag ggt gga gct ggc ccc agt cag ggc agt 9038 Arg Phe Pro Ser Gly Asn Gln Gly Gly Ala Gly Pro Ser Gln Gly Ser 2975 2980 2985 gga ggc ggc aca ggt ggc agt gta tac aca gaa gac aat gat gat gac 9086 Gly Gly Gly Thr Gly Gly Ser Val Tyr Thr Glu Asp Asn Asp Asp Asp 2990 2995 3000 ctg tat ggc taa 9098 Leu Tyr Gly * 3005 38 1268 DNA Homo sapiens CDS (562)..(1026) 38 agaggattag aacctagtat gccgtacgcc agcggtaccg gtccggaatt cccgggtcga 60 cccacgcgtc cgggcgacag ggagggagga agcctaggag tccgccgcgg gacggaggcc 120 tgggggaact gggagttcag ctttctgcag agggccacta ggaacctcgg attgcccacg 180 gaagccagcc actttctttg acagtccagc ccacctcctc ttctgcccgg agaagctcca 240 ggggctgcct ttgtgatcac agcatcttca caaggaccaa aggaaaataa gatttctcgt 300 aagaacaccg tgaccacatc tttaaaatga cccatttcgt ggctcccaca agatttacac 360 ctccacactg aggccggaag tggttttgcc cctataaaac atggcgaaaa gctttcttgt 420 ctccaaggaa acgccacgta atgagtcaaa gctgtggcgc acgcgcagaa gtacaagcta 480 ccggaagtga tggcgcccct actaaagcct tggggttagt acgcgtgcgc agcagtctct 540 tccgacagtt gtgttgtgcc a atg gtg gag aag aaa act tcg gtt cgc tcc 591 Met Val Glu Lys Lys Thr Ser Val Arg Ser 1 5 10 cag gac ccc ggg cag cgg cgg gtg ctg gac cgg gct gcc cgg cag cgt 639 Gln Asp Pro Gly Gln Arg Arg Val Leu Asp Arg Ala Ala Arg Gln Arg 15 20 25 cgc atc aac cgg cag ctg gag gcc ctg gag aat gac aac ttc cag gat 687 Arg Ile Asn Arg Gln Leu Glu Ala Leu Glu Asn Asp Asn Phe Gln Asp 30 35 40 gac ccc cac gcg gga ctc cct cag ctc ggc aag aga ctg cct cag ttt 735 Asp Pro His Ala Gly Leu Pro Gln Leu Gly Lys Arg Leu Pro Gln Phe 45 50 55 gat gac gat gcg gac act gga aag aaa aag aag aaa acc cga ggt gat 783 Asp Asp Asp Ala Asp Thr Gly Lys Lys Lys Lys Lys Thr Arg Gly Asp 60 65 70 cat ttt aaa ctt cgc ttc cga aaa aac ttt cag gcc ctg ttg gag gag 831 His Phe Lys Leu Arg Phe Arg Lys Asn Phe Gln Ala Leu Leu Glu Glu 75 80 85 90 cag aac ttg agt gtg gcc gag ggc cct aac tac ctg acg gcc tgt gcg 879 Gln Asn Leu Ser Val Ala Glu Gly Pro Asn Tyr Leu Thr Ala Cys Ala 95 100 105 gga ccc cca tcg cgg ccc cag cgc ccc ttc tgt gct gtc tgt ggc ttc 927 Gly Pro Pro Ser Arg Pro Gln Arg Pro Phe Cys Ala Val Cys Gly Phe 110 115 120 cca tcc ccc tac acc tgt gtc agc tgc ggt gcc cgg tac tgc act gtg 975 Pro Ser Pro Tyr Thr Cys Val Ser Cys Gly Ala Arg Tyr Cys Thr Val 125 130 135 cgc tgt ctg ggg acc cac cag gag acc agg tgt ctg aag tgg act gtg 1023 Arg Cys Leu Gly Thr His Gln Glu Thr Arg Cys Leu Lys Trp Thr Val 140 145 150 tga gcct gggcattccc agagaggaag ggccgctgtg cactgcccgg ccttcagaaa 1080 * 155 gacagaattt catcacccaa tgcaggggga gctcttcctg gaccaaggga ggagccgctc 1140 attcacccaa caaaactgtg tcttatctgc caggaaagac cagcctcact cctgggaact 1200 gtctggcagg taggctgggc cccccagtgc tgttagaata aaaagcctcg tgccggaaaa 1260 aaaaaaaa 1268 39 1460 DNA Homo sapiens CDS (230)..(814) 39 cgacccacgc gtccgggagt ccacacaggc agttggaaca actggaaagc gggaggaaat 60 aatggaaagt ctgctgacct tctcgtctta caaccaccat ataattcatg gatgacagac 120 tttgagaagg ttcctgccct caggggcttg gtctcctctt ctgcaaggat caatacaaat 180 tggcgacatt ttttccaaac tacttgacct tgaccagata gtcttcaag atg cca 235 Met Pro 1 aac tgg ggc gga ggc gca aaa tgt gga gcc tgt gaa aag acc gtc tac 283 Asn Trp Gly Gly Gly Ala Lys Cys Gly Ala Cys Glu Lys Thr Val Tyr 5 10 15 cat gca gaa gaa atc cag tgc aat gga agg agt ttc cac aag acg tgt 331 His Ala Glu Glu Ile Gln Cys Asn Gly Arg Ser Phe His Lys Thr Cys 20 25 30 ttc cac tgc atg gcc tgc agg aag gct ctt gac agc acg aca gtc gcg 379 Phe His Cys Met Ala Cys Arg Lys Ala Leu Asp Ser Thr Thr Val Ala 35 40 45 50 gct cat gag tcg gag atc tac tgc aag gtg tgc tat ggg cgc aga tat 427 Ala His Glu Ser Glu Ile Tyr Cys Lys Val Cys Tyr Gly Arg Arg Tyr 55 60 65 ggc ccc aaa ggg atc ggg tat gga caa ggc gct ggc tgt ctc agc aca 475 Gly Pro Lys Gly Ile Gly Tyr Gly Gln Gly Ala Gly Cys Leu Ser Thr 70 75 80 gac acg ggc gag cat ctc ggc ctg cag ttc caa cag tcc cca aag ccg 523 Asp Thr Gly Glu His Leu Gly Leu Gln Phe Gln Gln Ser Pro Lys Pro 85 90 95 gca cgc tca gtt acc acc agc aac cct tcc aaa ttc act gcg aag ttt 571 Ala Arg Ser Val Thr Thr Ser Asn Pro Ser Lys Phe Thr Ala Lys Phe 100 105 110 gga gag tcc gag aag tgc cct cga tgt ggc aag tca gtc tat gct gct 619 Gly Glu Ser Glu Lys Cys Pro Arg Cys Gly Lys Ser Val Tyr Ala Ala 115 120 125 130 gag aag gtt atg gga ggt ggc aag cct tgg cac aag acc tgt ttc cgc 667 Glu Lys Val Met Gly Gly Gly Lys Pro Trp His Lys Thr Cys Phe Arg 135 140 145 tgt gcc atc tgt ggg aag agt ctg gag tcc aca aat gtc act gac aaa 715 Cys Ala Ile Cys Gly Lys Ser Leu Glu Ser Thr Asn Val Thr Asp Lys 150 155 160 gat ggg gaa ctt tat tgc aaa gtt tgc tat gcc aaa aat ttt ggc ccc 763 Asp Gly Glu Leu Tyr Cys Lys Val Cys Tyr Ala Lys Asn Phe Gly Pro 165 170 175 acg ggt att ggg ttt gga ggc ctt aca caa caa gtg gaa aag aaa gaa 811 Thr Gly Ile Gly Phe Gly Gly Leu Thr Gln Gln Val Glu Lys Lys Glu 180 185 190 tga agag gtgcgccgtt tctcagattt tttgcgagcc taaaacactt gccaagtaat 868 * 195 cctgcacaga tcgatacctt tccccaaata gcctctcctt tgtagtcgta cattatgtgt 928 ttctcctcag aagtgatcag gtctttactg aatgttagaa gaggcctttg gaagaaaatt 988 atgtaaagtt taatctataa caaatgcttt attatttata atgcttggaa tgggagaggc 1048 aataaataaa tgttttagtg ctatcttgta tggctctaga tcttttcttt gagatagaaa 1108 ttttcaaaaa cataaagcta gttcaaaaaa cgagttgcag agcatataat aaatttggat 1168 gtcaactgag aaaggagtga gaaggaagaa acaatgcgca aaggaaagca gtctttcaga 1228 atctgtcagc caagtgtctt tctagttact gctaatggag aagaaaacag ggggtctggg 1288 agaaaataga gaacatgata gcaaaatcta aaaggaaaat caaaactaat aaaattgctg 1348 aagagttgat ccctttgtcc tatcgtgggg ctttgtaatg ttacacatct cgtgaaaact 1408 cagaaatgac aataaagcgt ggcatttgcc tctgtattat aaaaaaaaaa aa 1460 40 1208 DNA Homo sapiens CDS (341)..(1006) 40 ggaattcccg ggtcgacgat ttcgtgtggc tcagtcttgg ctcattgaaa ccttcacctc 60 ctgggttcaa gcgattctcc tgcctcagcc tcctgagtag ctgggattac aaccgtatgc 120 cacaacgtcc ggctaatttt gtaattttag tagagacagg gtttcaccat gttggccagg 180 ctggtctcaa actcctgacc tcaggtgatc cgctcacctc ggcctcccaa agtgctggga 240 ttacaggtgt gagccagcgc ctggcttagg ggccatcttt gagtcggccc tggctctgct 300 ccccaggtct cagtgctgtg ccccccccag agcctagagg atg ttt cat ggg atc 355 Met Phe His Gly Ile 1 5 cca gcc acg ccg ggc ata gga gcc cct ggg aac aag ccg gag ctg tat 403 Pro Ala Thr Pro Gly Ile Gly Ala Pro Gly Asn Lys Pro Glu Leu Tyr 10 15 20 gag gaa gtg aag ttg tac aag aac gcc cgg gag agg gag aag tac gac 451 Glu Glu Val Lys Leu Tyr Lys Asn Ala Arg Glu Arg Glu Lys Tyr Asp 25 30 35 aac atg gca gag ctg ttt gcg gtg gtg aag aca atg caa gcc ctg gag 499 Asn Met Ala Glu Leu Phe Ala Val Val Lys Thr Met Gln Ala Leu Glu 40 45 50 aag gcc tac atc aag gac tgt gtc tcc ccc agc gag tac act gca gcc 547 Lys Ala Tyr Ile Lys Asp Cys Val Ser Pro Ser Glu Tyr Thr Ala Ala 55 60 65 tgc tcc cgg ctc ctg gtc caa tac aaa gct gcc ttc agg cag gtc cag 595 Cys Ser Arg Leu Leu Val Gln Tyr Lys Ala Ala Phe Arg Gln Val Gln 70 75 80 85 ggc tca gaa atc agc tct att gac gaa ttc tgc cgc aag ttc cgc ctg 643 Gly Ser Glu Ile Ser Ser Ile Asp Glu Phe Cys Arg Lys Phe Arg Leu 90 95 100 gac tgc ccg ctg gcc atg gag cgg atc aag gag gac cgg ccc atc acc 691 Asp Cys Pro Leu Ala Met Glu Arg Ile Lys Glu Asp Arg Pro Ile Thr 105 110 115 atc aag gac gac aag ggc aac ctc aac cgc tgc atc gca gac gtg gtc 739 Ile Lys Asp Asp Lys Gly Asn Leu Asn Arg Cys Ile Ala Asp Val Val 120 125 130 tcg ctc ttc atc acg gtc atg gac aag ctg cgc ctg gag atc cgc gcc 787 Ser Leu Phe Ile Thr Val Met Asp Lys Leu Arg Leu Glu Ile Arg Ala 135 140 145 atg gat gag atc cag ccc gac ctg cga gag ctg atg gag acc atg cac 835 Met Asp Glu Ile Gln Pro Asp Leu Arg Glu Leu Met Glu Thr Met His 150 155 160 165 cgc atg agc cac ctc cca ccc gac ttt gag ggc cgc cag acg gtc agc 883 Arg Met Ser His Leu Pro Pro Asp Phe Glu Gly Arg Gln Thr Val Ser 170 175 180 cag tgg ctg cag acc ctg agc ggc atg tcg gcg tca gat gag ctg gac 931 Gln Trp Leu Gln Thr Leu Ser Gly Met Ser Ala Ser Asp Glu Leu Asp 185 190 195 gac tca cag gtg cgt cag atg ctg ttc gac ctg gag tca gcc tac aac 979 Asp Ser Gln Val Arg Gln Met Leu Phe Asp Leu Glu Ser Ala Tyr Asn 200 205 210 gcc ttc aac cgc ttc ctg cat gcc tga gcccg gggcactagc ccttgcacag 1031 Ala Phe Asn Arg Phe Leu His Ala * 215 220 aagggcagag tctgaggcga tggctcctgg tcccctgtcc gccacacagg ccgtggtcat 1091 ccacacaact cactgtctgc agctgcctgt ctggtgtctg tctttggtgt cagaactttg 1151 ggggccgggc ccctccccac aataaagatg ctctccgacc ttcctaaaaa aaaaaaa 1208 41 1097 DNA Homo sapiens CDS (175)..(858) misc_feature (1)...(1097) n = a,t,c or g 41 atttggccct cgaggccaag aattcggcac gagtcaaaga cagagacacc aagaagaatc 60 ggaacataca ggctttgata tcaaaggttt ataaagccaa tatctgggaa agagaaaacc 120 gtgagacttc cagatcttct ctggtgaagt gtgtttcctg caacgatcac gaac atg 177 Met 1 aac atc aaa gga tcg cca tgg aaa ggg tcc ctc ctg ctg ctg ctg gtg 225 Asn Ile Lys Gly Ser Pro Trp Lys Gly Ser Leu Leu Leu Leu Leu Val 5 10 15 tca aac ctg ctc ctg tgc cag agc gtg gcc ccc ttg ccc atc tgt ccc 273 Ser Asn Leu Leu Leu Cys Gln Ser Val Ala Pro Leu Pro Ile Cys Pro 20 25 30 ggc ggg gct gcc cga tgc cag gtg acc ctt cga gac ctg ttt gac cgc 321 Gly Gly Ala Ala Arg Cys Gln Val Thr Leu Arg Asp Leu Phe Asp Arg 35 40 45 gcc gtc gtc ctg tcc cac tac atc cat aac ctc tcc tca gaa atg ttc 369 Ala Val Val Leu Ser His Tyr Ile His Asn Leu Ser Ser Glu Met Phe 50 55 60 65 agc gaa ttc gat aaa cgg tat acc cat ggc cgg ggg ttc att acc aag 417 Ser Glu Phe Asp Lys Arg Tyr Thr His Gly Arg Gly Phe Ile Thr Lys 70 75 80 gcc atc aac agc tgc cac act tct tcc ctt gcc acc ccc gaa gac aag 465 Ala Ile Asn Ser Cys His Thr Ser Ser Leu Ala Thr Pro Glu Asp Lys 85 90 95 gag caa gcc caa cag atg aat caa aaa gac ttt ctg agc ctg ata gtc 513 Glu Gln Ala Gln Gln Met Asn Gln Lys Asp Phe Leu Ser Leu Ile Val 100 105 110 agc ata ttg cga tcc tgg aat gag cct ctg tat cat ctg gtc acg gaa 561 Ser Ile Leu Arg Ser Trp Asn Glu Pro Leu Tyr His Leu Val Thr Glu 115 120 125 gta cgt ggt atg caa gaa gcc ccc gag gct atc cta tcc aaa gct gta 609 Val Arg Gly Met Gln Glu Ala Pro Glu Ala Ile Leu Ser Lys Ala Val 130 135 140 145 gag att gag gag caa acc aaa cgg ctt cta gag ggc atg gag ctg ata 657 Glu Ile Glu Glu Gln Thr Lys Arg Leu Leu Glu Gly Met Glu Leu Ile 150 155 160 gtc agc cag gtt cat cct gaa acc aaa gaa aat gag atc tac cct gtc 705 Val Ser Gln Val His Pro Glu Thr Lys Glu Asn Glu Ile Tyr Pro Val 165 170 175 tgg tcg gga ctt cca tcc ctg cag atg gct gat gaa gag tct cgc ctt 753 Trp Ser Gly Leu Pro Ser Leu Gln Met Ala Asp Glu Glu Ser Arg Leu 180 185 190 tct gct tat tat aac ctg ctc cac tgc cta cgc agg gat tca cat aaa 801 Ser Ala Tyr Tyr Asn Leu Leu His Cys Leu Arg Arg Asp Ser His Lys 195 200 205 atc gac aat tat ctc aag ctc ctg aag tgc cga atc atc cac aac aac 849 Ile Asp Asn Tyr Leu Lys Leu Leu Lys Cys Arg Ile Ile His Asn Asn 210 215 220 225 aac tgc taa gcccaca tccatttcat ctatttctga gaaggtcctt aatgatccgt 905 Asn Cys * tccattgcaa gcttctttta gttgtatctc ttttgaatcc atgcttgggt gtaacaggtc 965 tcctcttaaa aaataaaaac tgactcctta gagacatcaa aaaaagactt cacttaatgg 1025 gccgggcgca gtggctcatg nctgtggtcc cggcactttg ggaggccgag gcaggcggat 1085 caggaggtca gg 1097 42 1697 DNA Homo sapiens CDS (25)..(1581) 42 agcggccgct atcgggaaga agaa atg act gtg gtg gag gaa gcg gat gat 51 Met Thr Val Val Glu Glu Ala Asp Asp 1 5 gac aaa aaa agg ctg ctg cag att att gac aga gat ggg gaa gag gaa 99 Asp Lys Lys Arg Leu Leu Gln Ile Ile Asp Arg Asp Gly Glu Glu Glu 10 15 20 25 gag gaa gag gag gag cca ttg gat gaa agc tca gtg aag aaa atg atc 147 Glu Glu Glu Glu Glu Pro Leu Asp Glu Ser Ser Val Lys Lys Met Ile 30 35 40 ctc aca ttt gaa aag aga tca tat aaa aac caa gaa ttg cgg att aag 195 Leu Thr Phe Glu Lys Arg Ser Tyr Lys Asn Gln Glu Leu Arg Ile Lys 45 50 55 ttt cca gac aat cca gag aag ttc atg gaa tcc gag ctg gac cta aat 243 Phe Pro Asp Asn Pro Glu Lys Phe Met Glu Ser Glu Leu Asp Leu Asn 60 65 70 gac atc att cag gag atg cac gtg gtg gcc acc atg cca gac ctg tac 291 Asp Ile Ile Gln Glu Met His Val Val Ala Thr Met Pro Asp Leu Tyr 75 80 85 cac ctt ctg gtg gag ctg aat gct gta cag tcg ctt ctc ggc ttg ctc 339 His Leu Leu Val Glu Leu Asn Ala Val Gln Ser Leu Leu Gly Leu Leu 90 95 100 105 gga cac gat aat aca gat gtg tcc ata gct gtg gtc gat ttg ctt cag 387 Gly His Asp Asn Thr Asp Val Ser Ile Ala Val Val Asp Leu Leu Gln 110 115 120 gaa tta aca gat ata gac acc ctc cat gag agt gaa gag gga gca gaa 435 Glu Leu Thr Asp Ile Asp Thr Leu His Glu Ser Glu Glu Gly Ala Glu 125 130 135 gtg ctc atc gat gct ctg gtg gat ggg cag gtg gta gca ctg ctg gta 483 Val Leu Ile Asp Ala Leu Val Asp Gly Gln Val Val Ala Leu Leu Val 140 145 150 cag aat ctg gag cgc ctg gat gag tct gtg aaa gag gag gca gat ggc 531 Gln Asn Leu Glu Arg Leu Asp Glu Ser Val Lys Glu Glu Ala Asp Gly 155 160 165 gtc cac aac act ctg gct att gtg gaa aac atg gct gag ttc cgg cct 579 Val His Asn Thr Leu Ala Ile Val Glu Asn Met Ala Glu Phe Arg Pro 170 175 180 185 gag atg tgt aca gag ggt gcc cag cag ggt ctt cta cag tgg ctg ttg 627 Glu Met Cys Thr Glu Gly Ala Gln Gln Gly Leu Leu Gln Trp Leu Leu 190 195 200 aag agg ctg aag gca aag atg cct ttt gat gcc aac aaa ctg tat tgc 675 Lys Arg Leu Lys Ala Lys Met Pro Phe Asp Ala Asn Lys Leu Tyr Cys 205 210 215 agt gaa gtg ctg gcc ata ttg ctc cag gac aat gat gaa aac agg gaa 723 Ser Glu Val Leu Ala Ile Leu Leu Gln Asp Asn Asp Glu Asn Arg Glu 220 225 230 ttg ctt ggg gag ctg gat gga atc gat gtg ctt ctt cag cag tta tcc 771 Leu Leu Gly Glu Leu Asp Gly Ile Asp Val Leu Leu Gln Gln Leu Ser 235 240 245 gtg ttt aaa aga cac aat ccc agc acg gct gag gag cag gag atg atg 819 Val Phe Lys Arg His Asn Pro Ser Thr Ala Glu Glu Gln Glu Met Met 250 255 260 265 gag aat ctg ttt gat tcc ctc tgc tcc tgt cta atg ctt agt tcc aat 867 Glu Asn Leu Phe Asp Ser Leu Cys Ser Cys Leu Met Leu Ser Ser Asn 270 275 280 cgt gag cgc ttc ctg aag ggc gag ggt ctt cag ctg atg aat ctc atg 915 Arg Glu Arg Phe Leu Lys Gly Glu Gly Leu Gln Leu Met Asn Leu Met 285 290 295 ctc agg gaa aag aag atc tcc cgg agc agt gcc ctg aaa gtg ctg gac 963 Leu Arg Glu Lys Lys Ile Ser Arg Ser Ser Ala Leu Lys Val Leu Asp 300 305 310 cat gcc atg att ggc ccc gaa ggc aca gac aac tgc cat aag ttt gtt 1011 His Ala Met Ile Gly Pro Glu Gly Thr Asp Asn Cys His Lys Phe Val 315 320 325 gac att ctt ggc tta cga acc atc ttt ccc ctc ttt atg aaa tct ccc 1059 Asp Ile Leu Gly Leu Arg Thr Ile Phe Pro Leu Phe Met Lys Ser Pro 330 335 340 345 agg aag atc aag aaa gtg gga acc act gag aag gaa cat gaa gag cat 1107 Arg Lys Ile Lys Lys Val Gly Thr Thr Glu Lys Glu His Glu Glu His 350 355 360 gtc tgt tcg atc ctg gct tcc ctc ctg cgg aac ctg aga ggg cag cag 1155 Val Cys Ser Ile Leu Ala Ser Leu Leu Arg Asn Leu Arg Gly Gln Gln 365 370 375 cgg acc cgg ctt ctg aat aaa ttc act gaa aat gac agt gag aag gtt 1203 Arg Thr Arg Leu Leu Asn Lys Phe Thr Glu Asn Asp Ser Glu Lys Val 380 385 390 gac aga cta atg gag ttg cat ttt aaa tat ctg ggt gca atg cag gtg 1251 Asp Arg Leu Met Glu Leu His Phe Lys Tyr Leu Gly Ala Met Gln Val 395 400 405 gcg gac aag aag att gaa ggg gaa aaa cac gac atg gtc cgg cga gga 1299 Ala Asp Lys Lys Ile Glu Gly Glu Lys His Asp Met Val Arg Arg Gly 410 415 420 425 gag atc atc gac aat gac acc gag gag gag ttc tac ctc cgg cgc ctg 1347 Glu Ile Ile Asp Asn Asp Thr Glu Glu Glu Phe Tyr Leu Arg Arg Leu 430 435 440 gat gcg ggg ctc ttt gtt ctc cag cac atc tgc tac atc atg gcc gag 1395 Asp Ala Gly Leu Phe Val Leu Gln His Ile Cys Tyr Ile Met Ala Glu 445 450 455 atc tgc aat gcc aat gtc ccc cag att cgc cag agg gtt cac cag atc 1443 Ile Cys Asn Ala Asn Val Pro Gln Ile Arg Gln Arg Val His Gln Ile 460 465 470 cta aac atg cga gga agc tcc atc aaa att gtc agg cat atc atc aag 1491 Leu Asn Met Arg Gly Ser Ser Ile Lys Ile Val Arg His Ile Ile Lys 475 480 485 gag tat gca gag aac atc ggg gac ggc cgg agc ccg gag ttc cgg gag 1539 Glu Tyr Ala Glu Asn Ile Gly Asp Gly Arg Ser Pro Glu Phe Arg Glu 490 495 500 505 aac gag caa aag cgc atc ctg ggc ttg ctg gag aac ttc tag aggcacc 1588 Asn Glu Gln Lys Arg Ile Leu Gly Leu Leu Glu Asn Phe * 510 515 ttggccctgc gcatcatgga ctctctcagc ttccctccca ggatcagttt ctacacaact 1648 ctgtgtggct tttggacaaa ttaaagctag ttttggtaaa aaaaaaaaa 1697 43 1895 DNA Homo sapiens CDS (188)..(1441) 43 tagcccgccc cctttgaaat gagtagcatg cggaaccgcc ccggaattcc cgggtcgacc 60 cacgcgtccg gggctctgtt cctgcccgcg cggtgttccg cattctgcaa gcctccggag 120 cgcacgtcgg cagtcggctc cctcgttgac cgaatcaccg acctctctcc ccagctgtat 180 ttccaaa atg tcg ctt tct aac aag ctg acg ctg gac aag ctg gac gtt 229 Met Ser Leu Ser Asn Lys Leu Thr Leu Asp Lys Leu Asp Val 1 5 10 aaa ggg aag cgg gtc gtt atg aga gtc gac ttc aat gtt cct atg aag 277 Lys Gly Lys Arg Val Val Met Arg Val Asp Phe Asn Val Pro Met Lys 15 20 25 30 aac aac cag ata aca aac aac cag agg att aag gct gct gtc cca agc 325 Asn Asn Gln Ile Thr Asn Asn Gln Arg Ile Lys Ala Ala Val Pro Ser 35 40 45 atc aaa ttc tgc ttg gac aat gga gcc aag tcg gta gtc ctt atg agc 373 Ile Lys Phe Cys Leu Asp Asn Gly Ala Lys Ser Val Val Leu Met Ser 50 55 60 cac cta ggc cgg cct gat ggt gtg ccc atg cct gac aag tac tcc tta 421 His Leu Gly Arg Pro Asp Gly Val Pro Met Pro Asp Lys Tyr Ser Leu 65 70 75 gag cca gtt gct gta gaa ctc aaa tct ctg ctg ggc aag gat gtt ctg 469 Glu Pro Val Ala Val Glu Leu Lys Ser Leu Leu Gly Lys Asp Val Leu 80 85 90 ttc ttg aag gac tgt gta ggc cca gaa gtg gag aaa gcc tgt gcc aac 517 Phe Leu Lys Asp Cys Val Gly Pro Glu Val Glu Lys Ala Cys Ala Asn 95 100 105 110 cca gct gct ggg tct gtc atc ctg ctg gag aac ctc cgc ttt cat gtg 565 Pro Ala Ala Gly Ser Val Ile Leu Leu Glu Asn Leu Arg Phe His Val 115 120 125 gag gaa gaa ggg aag gga aaa gat gct tct ggg aac aag gtt aaa gcc 613 Glu Glu Glu Gly Lys Gly Lys Asp Ala Ser Gly Asn Lys Val Lys Ala 130 135 140 gag cca gcc aaa ata gaa gct ttc cga gct tca ctt tcc aag cta ggg 661 Glu Pro Ala Lys Ile Glu Ala Phe Arg Ala Ser Leu Ser Lys Leu Gly 145 150 155 gat gtc tat gtc aat gat gct ttt ggc act gct cac aga gcc cac agc 709 Asp Val Tyr Val Asn Asp Ala Phe Gly Thr Ala His Arg Ala His Ser 160 165 170 tcc atg gta gga gtc aat ctg cca cag aag gct ggt ggg ttt ttg atg 757 Ser Met Val Gly Val Asn Leu Pro Gln Lys Ala Gly Gly Phe Leu Met 175 180 185 190 aag aag gag ctg aac tac ttt gca aag gcc ttg gag agc cca gag cga 805 Lys Lys Glu Leu Asn Tyr Phe Ala Lys Ala Leu Glu Ser Pro Glu Arg 195 200 205 ccc ttc ctg gcc atc ctg ggc gga gct aaa gtt gca gac aag atc cag 853 Pro Phe Leu Ala Ile Leu Gly Gly Ala Lys Val Ala Asp Lys Ile Gln 210 215 220 ctc atc aat aat atg ctg gac aaa gtc aat gag atg att att ggt ggt 901 Leu Ile Asn Asn Met Leu Asp Lys Val Asn Glu Met Ile Ile Gly Gly 225 230 235 gga atg gct ttt acc ttc ctt aag gtg ctc aac aac atg gag att ggc 949 Gly Met Ala Phe Thr Phe Leu Lys Val Leu Asn Asn Met Glu Ile Gly 240 245 250 act tct ctg ttt gat gaa gag gga gcc aag att gtc aaa gac cta atg 997 Thr Ser Leu Phe Asp Glu Glu Gly Ala Lys Ile Val Lys Asp Leu Met 255 260 265 270 tcc aaa gct gag aag aat ggt gtg aag att acc ttg cct gtt gac ttt 1045 Ser Lys Ala Glu Lys Asn Gly Val Lys Ile Thr Leu Pro Val Asp Phe 275 280 285 gtc act gct gac aag ttt gat gag aat gcc aag act ggc caa gcc act 1093 Val Thr Ala Asp Lys Phe Asp Glu Asn Ala Lys Thr Gly Gln Ala Thr 290 295 300 gtg gct tct ggc ata cct gct ggc tgg atg ggc ttg gac tgt ggt cct 1141 Val Ala Ser Gly Ile Pro Ala Gly Trp Met Gly Leu Asp Cys Gly Pro 305 310 315 gaa agc agc aag aag tat gct gag gct gtc act cgg gct aag cag att 1189 Glu Ser Ser Lys Lys Tyr Ala Glu Ala Val Thr Arg Ala Lys Gln Ile 320 325 330 gtg tgg aat ggt cct gtg ggg gta ttt gaa tgg gaa gct ttt gcc cgg 1237 Val Trp Asn Gly Pro Val Gly Val Phe Glu Trp Glu Ala Phe Ala Arg 335 340 345 350 gga acc aaa gct ctc atg gat gag gtg gtg aaa gcc act tct agg ggc 1285 Gly Thr Lys Ala Leu Met Asp Glu Val Val Lys Ala Thr Ser Arg Gly 355 360 365 tgc atc acc atc ata ggt ggt gga gac act gcc act tgc tgt gcc aaa 1333 Cys Ile Thr Ile Ile Gly Gly Gly Asp Thr Ala Thr Cys Cys Ala Lys 370 375 380 tgg aac acg gag gat aaa gtc agc cat gtg agc act ggg ggt ggt gcc 1381 Trp Asn Thr Glu Asp Lys Val Ser His Val Ser Thr Gly Gly Gly Ala 385 390 395 agt ttg gag ctc ctg gaa ggt aaa gtc ctt cct ggg gtg gat gct ctc 1429 Ser Leu Glu Leu Leu Glu Gly Lys Val Leu Pro Gly Val Asp Ala Leu 400 405 410 agc aat att tag tac tttcctgcct tttagttcct gtgcacagcc cctaagtcaa 1484 Ser Asn Ile * 415 cttagcattt tctgcatctc cacttggcat tagctaaaac cttccatgtc aagattcagc 1544 tagtggccaa gagatgcagt gccaggaacc cttaaacagt tgcacagcat ctcagctcat 1604 cttcactgca ccctggattt gcatacattc ttcaagatcc catttgaatt ttttagtgac 1664 taaaccattg tgcattctag agtgcatata tttatatttt gcctgttaaa aagaaagtga 1724 gcagtgttag cttagttctc ttttgatgta ggttattatg attagctttg tcactgtttc 1784 actactcagc atggaaacaa gatgaaattc catttgtagg tagtgagaca aaattgatga 1844 tccattaagt aaacaataaa agtgtccatt gaaaccgtga aaaaaaaaaa a 1895 44 1561 DNA Homo sapiens CDS (160)..(1254) 44 gggcgggcgg tttgaatgat agcgtcgaag cgcttgagcc catcttaagc tcggtaccga 60 gctcggatcc actagtccag tgtggtggaa ttcgtcagca gaagagataa aagcaaacag 120 gtctgggagg cagttctgtt gccactctct ctcctgtca atg atg gat ctc aga 174 Met Met Asp Leu Arg 1 5 aat acc cca gcc aaa tct ctg gac aag ttc att gaa gac tat ctc ttg 222 Asn Thr Pro Ala Lys Ser Leu Asp Lys Phe Ile Glu Asp Tyr Leu Leu 10 15 20 cca gac acg tgt ttc cgc atg caa atc aac cat gcc att gac atc atc 270 Pro Asp Thr Cys Phe Arg Met Gln Ile Asn His Ala Ile Asp Ile Ile 25 30 35 tgt ggg ttc ctg aag gaa agg tgc ttc cga ggt agc tcc tac cct gtg 318 Cys Gly Phe Leu Lys Glu Arg Cys Phe Arg Gly Ser Ser Tyr Pro Val 40 45 50 tgt gtg tcc aag gtg gta aag ggt ggc tcc tca ggc aag ggc acc acc 366 Cys Val Ser Lys Val Val Lys Gly Gly Ser Ser Gly Lys Gly Thr Thr 55 60 65 ctc aga ggc cga tct gac gct gac ctg gtt gtc ttc ctc agt cct ctc 414 Leu Arg Gly Arg Ser Asp Ala Asp Leu Val Val Phe Leu Ser Pro Leu 70 75 80 85 acc act ttt cag gat cag tta aat cgc cgg gga gag ttc atc cag gaa 462 Thr Thr Phe Gln Asp Gln Leu Asn Arg Arg Gly Glu Phe Ile Gln Glu 90 95 100 att agg aga cag ctg gaa gcc tgt caa aga gag aga gca ttt tcc gtg 510 Ile Arg Arg Gln Leu Glu Ala Cys Gln Arg Glu Arg Ala Phe Ser Val 105 110 115 aag ttt gag gtc cag gct cca cgc tgg ggc aac ccc cgt gcg ctc agc 558 Lys Phe Glu Val Gln Ala Pro Arg Trp Gly Asn Pro Arg Ala Leu Ser 120 125 130 ttc gta ctg agt tcg ctc cag ctc ggg gag ggg gtg gag ttc gat gtg 606 Phe Val Leu Ser Ser Leu Gln Leu Gly Glu Gly Val Glu Phe Asp Val 135 140 145 ctg cct gcc ttt gat gcc ctg ggt cag ttg act ggc agc tat aaa cct 654 Leu Pro Ala Phe Asp Ala Leu Gly Gln Leu Thr Gly Ser Tyr Lys Pro 150 155 160 165 aac ccc caa atc tat gtc aag ctc atc gag gag tgc acc gac ctg cag 702 Asn Pro Gln Ile Tyr Val Lys Leu Ile Glu Glu Cys Thr Asp Leu Gln 170 175 180 aaa gag ggc gag ttc tcc acc tgc ttc aca gaa cta cag aga gac ttc 750 Lys Glu Gly Glu Phe Ser Thr Cys Phe Thr Glu Leu Gln Arg Asp Phe 185 190 195 ctg aag cag cgc ccc acc aag ctc aag agc ctc atc cgc cta gtc aag 798 Leu Lys Gln Arg Pro Thr Lys Leu Lys Ser Leu Ile Arg Leu Val Lys 200 205 210 cac tgg tac caa aat tgt aag aag aag ctt ggg aag ctg cca cct cag 846 His Trp Tyr Gln Asn Cys Lys Lys Lys Leu Gly Lys Leu Pro Pro Gln 215 220 225 tat gcc ctg gag ctc ctg acg gtc tat gct tgg gag cga ggg agc atg 894 Tyr Ala Leu Glu Leu Leu Thr Val Tyr Ala Trp Glu Arg Gly Ser Met 230 235 240 245 aaa aca cat ttc aac aca gcc cag gga ttt cgg acg gtc ttg gaa tta 942 Lys Thr His Phe Asn Thr Ala Gln Gly Phe Arg Thr Val Leu Glu Leu 250 255 260 gtc ata aac tac cag caa ctc tgc atc tac tgg aca aag tat tat gac 990 Val Ile Asn Tyr Gln Gln Leu Cys Ile Tyr Trp Thr Lys Tyr Tyr Asp 265 270 275 ttt aaa aac ccc att att gaa aag tac ctg aga agg cag ctc acg aaa 1038 Phe Lys Asn Pro Ile Ile Glu Lys Tyr Leu Arg Arg Gln Leu Thr Lys 280 285 290 ccc agg cct gtg atc ctg gac ccg gcg gac cct aca gga aac ttg ggt 1086 Pro Arg Pro Val Ile Leu Asp Pro Ala Asp Pro Thr Gly Asn Leu Gly 295 300 305 ggt gga gac cca aag ggt tgg agg cag ctg gca caa gag gct gag gcc 1134 Gly Gly Asp Pro Lys Gly Trp Arg Gln Leu Ala Gln Glu Ala Glu Ala 310 315 320 325 tgg ctg aat tac cca tgc ttt aag aat tgg gat ggg tcc cca gtg agc 1182 Trp Leu Asn Tyr Pro Cys Phe Lys Asn Trp Asp Gly Ser Pro Val Ser 330 335 340 tcc tgg att ctg ctg gtg aga cct cct gct tcc tcc ctg cca ttc atc 1230 Ser Trp Ile Leu Leu Val Arg Pro Pro Ala Ser Ser Leu Pro Phe Ile 345 350 355 cct gcc cct ctc cat gaa gct tga gacatatagc tggagaccat tctttccaaa 1284 Pro Ala Pro Leu His Glu Ala * 360 gaacttacct cttgccaaag gccatttata ttcatatagt gacaggctgt gctccatatt 1344 ttacagtcat tttggtcaca atcgagggtt tctggaattt tcacatccct tgtccagaat 1404 tcattcccct aagagtaata ataaataatc tctaacacca tttattgact gtttgcttcg 1464 gggtcaggtt ttgtcctaga gccctttaat atgcactttt tcattaaata gtcccaacaa 1524 tcccatgacc cagcaagacg agaagaaaaa aaaaaaa 1561 45 1309 DNA Homo sapiens CDS (143)..(688) 45 atcgacttcg aacgcttgcg gtaccggtcc ggaattcccg ggtcgaccca cgcgtccgct 60 tgtgggagca aaaccaacgc ctggctcgga gcagcagcct ctgaggtgtc cctggccagt 120 gtccttccac ctgtccacaa gc atg ggg aac atc ttc gcc aac ctc ttc aag 172 Met Gly Asn Ile Phe Ala Asn Leu Phe Lys 1 5 10 ggc ctt ttt ggc aaa aaa gaa atg cgc atc ctc atg gtg ggc ctg gat 220 Gly Leu Phe Gly Lys Lys Glu Met Arg Ile Leu Met Val Gly Leu Asp 15 20 25 gct gca ggg aag acc acg atc ctc tac aag ctt aag ctg ggt gag atc 268 Ala Ala Gly Lys Thr Thr Ile Leu Tyr Lys Leu Lys Leu Gly Glu Ile 30 35 40 gtg acc acc att ccc acc ata ggc ttc aac gtg gaa acc gtg gag tac 316 Val Thr Thr Ile Pro Thr Ile Gly Phe Asn Val Glu Thr Val Glu Tyr 45 50 55 aag aac atc agc ttc act gtg tgg gac gtg ggt ggc cag gac aag atc 364 Lys Asn Ile Ser Phe Thr Val Trp Asp Val Gly Gly Gln Asp Lys Ile 60 65 70 cgg ccc ctg tgg cgc cac tac ttc cag aac aca caa ggc ctg atc ttc 412 Arg Pro Leu Trp Arg His Tyr Phe Gln Asn Thr Gln Gly Leu Ile Phe 75 80 85 90 gtg gtg gac agc aat gac aga gag cgt gtg aac gag gcc cgt gag gag 460 Val Val Asp Ser Asn Asp Arg Glu Arg Val Asn Glu Ala Arg Glu Glu 95 100 105 ctc atg agg atg ctg gcc gag gac gag ctc cgg gat gct gtc ctc ctg 508 Leu Met Arg Met Leu Ala Glu Asp Glu Leu Arg Asp Ala Val Leu Leu 110 115 120 gtg ttc gcc aac aag cag gac ctc ccc aac gcc atg aat gcg gcc gag 556 Val Phe Ala Asn Lys Gln Asp Leu Pro Asn Ala Met Asn Ala Ala Glu 125 130 135 atc aca gac aag ctg ggg ctg cac tca cta cgc cac agg aac tgg tac 604 Ile Thr Asp Lys Leu Gly Leu His Ser Leu Arg His Arg Asn Trp Tyr 140 145 150 att cag gcc acc tgc gcc acc agc ggc gac ggg ctc tat gaa gga ctg 652 Ile Gln Ala Thr Cys Ala Thr Ser Gly Asp Gly Leu Tyr Glu Gly Leu 155 160 165 170 gac tgg ctg tcc aat cag ctc cgg aac cag aag tga acgc gacccccctc 702 Asp Trp Leu Ser Asn Gln Leu Arg Asn Gln Lys * 175 180 cctctcactc ctcttgccct ctgctttact ctcatgtggc aaacgtgcgg ctcgtggtgt 762 gagtgccaga agctgcctcc gtggtttggt caccgtgtgc atcgcaccgt gctgtaaatg 822 tggcagacgc agcctgcggc caggcttttt atttaatgta aatagttttt gtttccaatg 882 aggcagtttc tggtactcct atgcaatatt actcagcttt ttttattgta aaaagaaaaa 942 tcaactcact gttcagtgct gagaggggat gtaggcccat gggcacctgg cctccaggag 1002 tcgctgtgtt gggagagccg gccacgccct tggctttaga gctgtgttga aatccatttt 1062 ggtggttggt ttttaaccca aactcagtgc attttttaaa atagttaaga atccaagtcg 1122 agaacacttg aacacacaga agggagaccc cgcctagcat agatttgcag ttacgggctg 1182 gatgccagtc gccagcccat ctgtttccct cgggaacatg aggtggtggg ggcgcaacag 1242 actgcgaaca attctgcatg gtcacaggag agaatccccc aactcgcttg gccttgggga 1302 ccctgct 1309 46 4630 DNA Homo sapiens CDS (3211)..(4233) 46 ggtcggcggc attcggcggc gatggagcgg ccctggggag ctgcggacgg cctctcgcgc 60 tggccccatg gcctcggcct cctcctcctc ctgcagctgc tgccgccgtc gaccctcagc 120 caggaccggc tggacgcgcc gccgccgccc gctgcgccgc tgccgcgctg gtctggcccc 180 atcggggtga gctgggggct gcgggcggcc gcagccgggg gcgcgtttcc ccgcggcggc 240 cgttggcgtc gcagcgcgcc gggcgaggac gaggagtgcg gccgggtccg ggacttcgtc 300 gccaagctgg ccaacaacac gcaccagcat gtgtttgatg atctcagagg ctcagtatcc 360 ttgtcctggg ttggagatag cactggggtc attctagtct tgactacctt ccatgtacca 420 ctggtaatta tgacttttgg acagtccaag ctatatcgaa gtgaggatta tgggaagaac 480 tttaaggata ttacagatct catcaataac acctttattc ggactgaatt tggcatggct 540 attggtcctg agaactctgg aaaggtggtg ttaacagcag aggtgtctgg aggaagtcgt 600 ggaggaagaa tcttcagatc atcagatttt gcgaagaatt ttgtgcaaac agatctccct 660 tttcatcctc tcactcagat gatgtatagc cctcagaatt ctgattatct tttagctctc 720 agcactgaaa atggcctgtg ggtgtccaag aattttgggg gaaaatggga agaaatccac 780 aaagcagtat gtttggccaa atggggatca gacaacacca tcttctttac aacctatgca 840 aatggctcct gcaaagctga ccttggggct ctggaattat ggagaacttc agacttggga 900 aaaagcttca aaactattgg tgtgaaaatc tactcatttg gtcttggggg acgtttcctt 960 tttgcctctg tgatggctga taaggatacc acaagaagga tccacgtttc aacagatcaa 1020 ggggacacat ggagcatggc ccagctcccc tccgtgggac aggaacagtt ctattctatt 1080 ctggcagcaa atgatgacat ggtattcatg catgtagatg aacctggaga cactgggttt 1140 ggcacaatct ttacctcaga tgatcgaggc attgtctatt ccaagtcttt ggaccgacat 1200 ctctacacta ccacaggcgg agagacggac tttaccaacg tgacctccct ccgcggcgtc 1260 tacataacaa gcgtgctctc cgaagataat tctatccaga ccatgatcac ttttgaccaa 1320 ggaggaaggt ggacgcacct gaggaagcct gaaaacagtg aatgtgatgc tacagcaaaa 1380 aacaagaatg agtgcagcct tcatattcat gcttccatca gcatctccca gaaactgaat 1440 gttccaatgg ccccactctc agagccgaag ctcgtaggca tggtcattgc tcatggtagc 1500 gtgggggatg ccatctcagt gatggttcca gatgtgtaca tctcagatga tgggggttac 1560 tcctggacaa agatgctgga aggaccccac tattacacca tcctggattc tggaggcatc 1620 attgtggcca ttgagcacag cagccgtcct atcaatgtga ttaagttctc cacagacgaa 1680 ggtcaatgct ggcaaaccta cacgttcacc agggacccca tctatttcac tggcctagct 1740 tcagaacctg gagctaggtc catgaatatc agcatttggg gcttcacaga atctttcctg 1800 accagccagt gggtctccta caccattgat tttaaagata tccttgaaag gaactgtgaa 1860 gagaaggact ataccatatg gctggcacac tccacagacc ctgaagatta tgaagatggc 1920 tgcattttgg gctacaaaga acagtttctg cggctacgca agtcatccat gtgtcagaat 1980 ggtcgagact atgttgtgac caagcagccc tccatctgcc tctgttccct ggaggacttt 2040 ctctgtgatt ttggctacta ccgtccagaa aatgactcca agtgtgtgga acagccagaa 2100 ctgaagggcc acgacctgga gttttgtctg tacggaaaga gaagagaaga acacctaaca 2160 acaaatgggt accggaaaat tccaggggac aaatgccagg gtggggtaaa tccagttcga 2220 gaagtaaaag acttgaaaaa gaaatgcaca agcaactttt tgagtccgga aaaacagaat 2280 tccaagtcaa attctgttcc aattatcctg gccatcgtgg gattgatgct ggtcacagtc 2340 gtagcaggag tgctcattgt gaagaaatat gtctgtgggg gaaggttcct ggtgcatcga 2400 tactctgtgc tgcagcagca tgcagaggcc aatggtgtgg atggtgtgga tgctttggac 2460 acagcctccc acactaataa aagtggttat catgatgact cagatgagga cctcttggaa 2520 tagctcttca gaggagctgg acccagcatg gatggtggaa ccacagtacc tcttacactc 2580 cctgtggctc caacttcagg aaataaattt cccattgcga ggacccagct ctgtttctgc 2640 tgcttccatc aaagccaaaa gacctacact aaagaaatgc agggtggggg tggggaaccc 2700 tgagcacttt tttacaattg gctctgagaa aaagggagac attttaaatt ctttaacttc 2760 ttatttctcg tcctgtctct ttgcaaagta tgggcttttg tttttgtttt ttaagggaaa 2820 cgaaatggaa ttcgaaggga ccttttcact aaccccactt ctgtgtgttc tgcatggtgc 2880 ctgccccagg gcatctgcca actccagtat cagctctcac agtgtacttg gtaccatccc 2940 tgggctctgc tggcgagacg aaacagctgt agagatgaaa acaggctgca gaggctggca 3000 cagctggccg gcttttctcc atctggggac agtcctactc caagaacact gcacaccagc 3060 tcctcacaca gatcccactt acggcgcgca acgggttcta ggctgcaggc agctcgagga 3120 cccgcggccc cgccccggct cggcctggca gatagcagag gcagcaggcc gtgccggggg 3180 ggcatgttgc tgtaaccagt ggcccagggg atg tta cgg tgg aca gtg cac 3231 Met Leu Arg Trp Thr Val His 1 5 ctg gag ggc ggg ccc cgc agg gtg aac cat gct gca gtg gct gtc ggg 3279 Leu Glu Gly Gly Pro Arg Arg Val Asn His Ala Ala Val Ala Val Gly 10 15 20 cat cgg gta tac tcc ttc ggg ggt tac tgc tct ggt gaa gac tat gag 3327 His Arg Val Tyr Ser Phe Gly Gly Tyr Cys Ser Gly Glu Asp Tyr Glu 25 30 35 aca ctg cgt cag ata gat gtg cac att ttc aat gca gtg tcc ttg cgt 3375 Thr Leu Arg Gln Ile Asp Val His Ile Phe Asn Ala Val Ser Leu Arg 40 45 50 55 tgg aca aag ctg ccc ccg gtg aag tct gcc atc cgt ggg caa gct cct 3423 Trp Thr Lys Leu Pro Pro Val Lys Ser Ala Ile Arg Gly Gln Ala Pro 60 65 70 gtg gta ccc tac atg cgc tat gga cac tca acc gtc ctc atc gac gac 3471 Val Val Pro Tyr Met Arg Tyr Gly His Ser Thr Val Leu Ile Asp Asp 75 80 85 aca gtc ctc ctt tgg ggc ggg cgg aat gac acc gaa ggg gcc tgc aat 3519 Thr Val Leu Leu Trp Gly Gly Arg Asn Asp Thr Glu Gly Ala Cys Asn 90 95 100 gtg ctc tat gcc ttt gac gtc aat acg cac aag tgg ttc aca ccc cga 3567 Val Leu Tyr Ala Phe Asp Val Asn Thr His Lys Trp Phe Thr Pro Arg 105 110 115 gtg tca ggg aca gtt cct ggg gcc cgg gat gga cat tca gcc tgt gtc 3615 Val Ser Gly Thr Val Pro Gly Ala Arg Asp Gly His Ser Ala Cys Val 120 125 130 135 cta ggc aag atc atg tac att ttt ggg ggc tac gag cag cag gcg gac 3663 Leu Gly Lys Ile Met Tyr Ile Phe Gly Gly Tyr Glu Gln Gln Ala Asp 140 145 150 tgt ttt tcc aat gac att cac aag cta gat acc agc acc atg aca tgg 3711 Cys Phe Ser Asn Asp Ile His Lys Leu Asp Thr Ser Thr Met Thr Trp 155 160 165 act ctt atc tgt aca aag ggc agc cct gca cgc tgg agg gac ttc cac 3759 Thr Leu Ile Cys Thr Lys Gly Ser Pro Ala Arg Trp Arg Asp Phe His 170 175 180 tca gcc aca atg ctg gga agt cac atg tat gtc ttt ggg ggc cgt gcc 3807 Ser Ala Thr Met Leu Gly Ser His Met Tyr Val Phe Gly Gly Arg Ala 185 190 195 gac cgc ttt ggg cca ttc cat tcc aac aat gag att tac tgc aac cgc 3855 Asp Arg Phe Gly Pro Phe His Ser Asn Asn Glu Ile Tyr Cys Asn Arg 200 205 210 215 att cga gtc ttt gac acc aga act gag gct tgg ctg gac tgt ccc ccg 3903 Ile Arg Val Phe Asp Thr Arg Thr Glu Ala Trp Leu Asp Cys Pro Pro 220 225 230 act cca gtg ctg cct gag ggg cgc cgg agc cac tcg gcc ttt ggc tac 3951 Thr Pro Val Leu Pro Glu Gly Arg Arg Ser His Ser Ala Phe Gly Tyr 235 240 245 aat ggg gag ctg tac atc ttt ggt ggt tat aat gca agg ctg aac cgg 3999 Asn Gly Glu Leu Tyr Ile Phe Gly Gly Tyr Asn Ala Arg Leu Asn Arg 250 255 260 cac ttc cat gac ctc tgg aag ttt aat cct gtg tcc ttt acc tgg aaa 4047 His Phe His Asp Leu Trp Lys Phe Asn Pro Val Ser Phe Thr Trp Lys 265 270 275 aag att gaa ccg aag ggg aag ggg cca tgt ccc cgc cgg cgc cag tgc 4095 Lys Ile Glu Pro Lys Gly Lys Gly Pro Cys Pro Arg Arg Arg Gln Cys 280 285 290 295 tgc tgt att gtt ggt gac aag att gtc ctc ttt ggg ggt acc agt cca 4143 Cys Cys Ile Val Gly Asp Lys Ile Val Leu Phe Gly Gly Thr Ser Pro 300 305 310 tct cct gag gaa ggc ctg gga gat gaa ttt gac ctt ata gat cat tct 4191 Ser Pro Glu Glu Gly Leu Gly Asp Glu Phe Asp Leu Ile Asp His Ser 315 320 325 gac tta cac att ttg gac ttt agt aag tat agt tat tcc tga attgctc 4240 Asp Leu His Ile Leu Asp Phe Ser Lys Tyr Ser Tyr Ser * 330 335 340 ctgccatcaa ggtccctgtc tttgggtggt gatatccagg actggatcct ttcctcacct 4300 ttgaccatct cctttccaca acttctccct taaaggccct agtctgaaga ctctgtgcaa 4360 actggccgtg attcagtata acctagacca gtcctgtttg cctcatgata tcaggtacag 4420 cgagtggttg gaagggaggg attgagtgag ggtgagtgag cagcacaggt gtgacctgat 4480 tagcttggtt tcagtatgga ggagacagac atttttgttc tgctggagat gtagtgtact 4540 tgtgggaaat gagatggaag gaggtgaatt gacagccatc atataaaact gtcagcagaa 4600 taaattatat gaaaaaaaaa aaaaagctta 4630 47 1162 DNA Homo sapiens CDS (267)..(734) misc_feature (1)...(1162) n = a,t,c or g 47 gtatgtgtgt gagtgatttg tcctgggaga attgtgtcac tattatctct ggtcatttta 60 atattgttgt gtctgtgttg tggggaggtg cgattcgaat tcgataccgt tatgacatgc 120 gtannnnnnn nnnnnnnnnn nnnnnnnnnn atttggccct cgaggccaag aattcggcac 180 gagtgttcca ttttccgtat ctgcttcggg cttccacctc atttttttcg ctttgcccat 240 tctgtttcag ccagtcgcca agaatc atg aaa gtc gcc agt ggc agc acc gcc 293 Met Lys Val Ala Ser Gly Ser Thr Ala 1 5 acc gcc gcc gcg ggc ccc agc tgc gcg ctg aag gcc ggc aag aca gcg 341 Thr Ala Ala Ala Gly Pro Ser Cys Ala Leu Lys Ala Gly Lys Thr Ala 10 15 20 25 agc ggt gcg ggc gag gtg gtg cgc tgt ctg tct gag cag agc gtg gcc 389 Ser Gly Ala Gly Glu Val Val Arg Cys Leu Ser Glu Gln Ser Val Ala 30 35 40 atc tcg cgc tgc gcc ggg ggc gcc ggg gcg cgc ctg cct gcc ctg ctg 437 Ile Ser Arg Cys Ala Gly Gly Ala Gly Ala Arg Leu Pro Ala Leu Leu 45 50 55 gac gag cag cag gta aac gtg ctg ctc tac gac atg aac ggc tgt tac 485 Asp Glu Gln Gln Val Asn Val Leu Leu Tyr Asp Met Asn Gly Cys Tyr 60 65 70 tca cgc ctc aag gag ctg gtg ccc acc ctg ccc cag aac cgc aag gtg 533 Ser Arg Leu Lys Glu Leu Val Pro Thr Leu Pro Gln Asn Arg Lys Val 75 80 85 agc aag gtg gag att ctc cag cac gtc atc gac tac atc agg gac ctt 581 Ser Lys Val Glu Ile Leu Gln His Val Ile Asp Tyr Ile Arg Asp Leu 90 95 100 105 cag ttg gag ctg aac tcg gaa tcc gaa gtt gga acc ccc ggg ggc cga 629 Gln Leu Glu Leu Asn Ser Glu Ser Glu Val Gly Thr Pro Gly Gly Arg 110 115 120 ggg ctg ccg gtc cgg gct ccg ctc agc acc ctc aac ggc gag atc agc 677 Gly Leu Pro Val Arg Ala Pro Leu Ser Thr Leu Asn Gly Glu Ile Ser 125 130 135 gcc ctg acg gcc gag gcg gca tgc gtt cct gcg gac gat cgc atc ttg 725 Ala Leu Thr Ala Glu Ala Ala Cys Val Pro Ala Asp Asp Arg Ile Leu 140 145 150 tgt cgc tga agcgcct cccccaggga ccggcggacc ccagccatcc agggggcaag 781 Cys Arg * 155 aggaattacg tgctctgtgg gtctccccca acgcgcctcg ccggatctga gggagaacaa 841 gaccgatcgg cggccactgc gcccttaact gcatccagcc tggggctgag gctgaggcac 901 tggcgaggaa agggcgctcc tctttgcaca cctactagtc accaaagact ttagggggtg 961 ggattccact cgtgtgtttc tattttttga aaagcagaca ttttaaaaaa tggtcacgtt 1021 tggtgctttt cagatttctg aggaaattgc tttgtattgt atattacaat gatccccgac 1081 tgaaaatatt gttttacaat agttctgtgg ggctgttttt ttgttattaa acaaataatt 1141 tagatggtga aaaaaaaaaa a 1162 48 1801 DNA Homo sapiens CDS (143)..(1159) 48 aagctggtac gcctgcaggt accggtccgg aattcccggg tcgacgattt cgtgcaggcg 60 ctgggcagtg tggaggtcgt tggagtcact tccgcgtcac cagctcctgt gcctgccagt 120 cggtgcccct cccgctccag cc atg ctc tcc gcc ctc gcc cgg cct gtc agc 172 Met Leu Ser Ala Leu Ala Arg Pro Val Ser 1 5 10 gct gct ctc cgc cgc agc ttc agc acc tcg gcc cag aac aat gct aaa 220 Ala Ala Leu Arg Arg Ser Phe Ser Thr Ser Ala Gln Asn Asn Ala Lys 15 20 25 gta gct gtg cta ggg gcc tct gga ggc atc ggg cag cca ctt tca ctt 268 Val Ala Val Leu Gly Ala Ser Gly Gly Ile Gly Gln Pro Leu Ser Leu 30 35 40 ctc ctg aag aac agc ccc ttg gtg agc cgc ctg acc ctc tat gat atc 316 Leu Leu Lys Asn Ser Pro Leu Val Ser Arg Leu Thr Leu Tyr Asp Ile 45 50 55 gcg cac aca ccc gga gtg gcc gca gat ctg agc cac atc gag acc aaa 364 Ala His Thr Pro Gly Val Ala Ala Asp Leu Ser His Ile Glu Thr Lys 60 65 70 gcc gct gtg aaa ggc tac ctc gga cct gaa cag ctg cct gac tgc ctg 412 Ala Ala Val Lys Gly Tyr Leu Gly Pro Glu Gln Leu Pro Asp Cys Leu 75 80 85 90 aaa ggt tgt gat gtg gta gtt att ccg gct gga gtc ccc aga aag cca 460 Lys Gly Cys Asp Val Val Val Ile Pro Ala Gly Val Pro Arg Lys Pro 95 100 105 ggc atg acc cgg gac gac ctg ttc aac acc aat gcc acg att gtg gcc 508 Gly Met Thr Arg Asp Asp Leu Phe Asn Thr Asn Ala Thr Ile Val Ala 110 115 120 acc ctg acc gct gcc tgt gcc cag cac tgc ccg gaa gcc atg atc tgc 556 Thr Leu Thr Ala Ala Cys Ala Gln His Cys Pro Glu Ala Met Ile Cys 125 130 135 gtc att gcc aat ccg gtt aat tcc acc atc ccc atc aca gca gaa gtt 604 Val Ile Ala Asn Pro Val Asn Ser Thr Ile Pro Ile Thr Ala Glu Val 140 145 150 ttc aag aag cat gga gtg tac aac ccc aac aaa atc ttc ggc gtg acg 652 Phe Lys Lys His Gly Val Tyr Asn Pro Asn Lys Ile Phe Gly Val Thr 155 160 165 170 acc ctg gac atc gtc aga gcc aac acc ttt gtt gca gag ctg aag ggt 700 Thr Leu Asp Ile Val Arg Ala Asn Thr Phe Val Ala Glu Leu Lys Gly 175 180 185 ttg gat cca gct cga gtc aac gtc cct gtc att ggt ggc cat gct ggg 748 Leu Asp Pro Ala Arg Val Asn Val Pro Val Ile Gly Gly His Ala Gly 190 195 200 aag acc atc atc ccc ctg atc tct cag tgc acc ccc aag gtg gac ttt 796 Lys Thr Ile Ile Pro Leu Ile Ser Gln Cys Thr Pro Lys Val Asp Phe 205 210 215 ccc cag gac cag ctg aca gca ctc act ggg cgg atc cag gag gcc ggc 844 Pro Gln Asp Gln Leu Thr Ala Leu Thr Gly Arg Ile Gln Glu Ala Gly 220 225 230 acg gag gtg gtc aag gct aaa gcc gga gca ggc tct gcc acc ctc tcc 892 Thr Glu Val Val Lys Ala Lys Ala Gly Ala Gly Ser Ala Thr Leu Ser 235 240 245 250 atg gcg tat gcc ggc gcc cgc ttt gtc ttc tcc ctt gtg gat gca atg 940 Met Ala Tyr Ala Gly Ala Arg Phe Val Phe Ser Leu Val Asp Ala Met 255 260 265 aat gga aag gaa ggt gtt gtg gaa tgt tcc ttc gtt aag tca cag gaa 988 Asn Gly Lys Glu Gly Val Val Glu Cys Ser Phe Val Lys Ser Gln Glu 270 275 280 acg gaa tgt acc tac ttc tcc aca ccg ctg ctg ctt ggg aaa aag ggc 1036 Thr Glu Cys Thr Tyr Phe Ser Thr Pro Leu Leu Leu Gly Lys Lys Gly 285 290 295 atc gag aag aac ctg ggc atc ggc aaa gtc tcc tct ttt gag gag aag 1084 Ile Glu Lys Asn Leu Gly Ile Gly Lys Val Ser Ser Phe Glu Glu Lys 300 305 310 atg atc tcg gat gcc atc ccc gag ctg aag gcc tcc atc aag aag ggg 1132 Met Ile Ser Asp Ala Ile Pro Glu Leu Lys Ala Ser Ile Lys Lys Gly 315 320 325 330 gaa gat ttc gtg aag acc ctg aag tga gccgc tgtgacgggt ggccagtttc 1184 Glu Asp Phe Val Lys Thr Leu Lys * 335 cttaatttat gaaggcatca tgtcactgca aagccgttgc agataaactt tgtattttaa 1244 tttgctttgg tgatgattac tgtattgaca tcatcatgcc ttccaaattg tgggtggctc 1304 tgtgggcgca tcaataaaag ccgtccttga ttttattttt caaggtccct tctgtaaatg 1364 ctgtgctttc ttccctgtga gagccaactt tagagtgtct gctacctctt cattaccaat 1424 cagaattaga tgatgtttaa ctgttagact gaagcgtgac gctttcatca gtagcttcaa 1484 gaaagtctaa attgttaatt tatggaattg gacacagtat tcagtttacc cgtacatgct 1544 cctcccgccc ctcctgttgg cacccttgca tcgcccaggc ctgattcctc ctgggggtag 1604 ttcaccccca cggggtcata gttcagcggc gaatgccagg cagctgtttt ctggctgagc 1664 aaacagcacc tttctcattg agcttcctct actgacctct gtcccccttg ggatttcatc 1724 ttctgaccga accctgatgt tcagtggcag agacagccca tagccagaac tgtgggtaga 1784 ccagggttgg ggtgtgc 1801 49 3276 DNA Homo sapiens CDS (1502)..(1990) 49 atgaagccgc cgcgcgcccg cggccccgcc caggaagctc cgccactggc agctgggtac 60 cgattcatca ccgacgtgtt taaagtatcc gtgtgtgtgc ctccaaaaac cgtttgtgga 120 gacagcagag tgcccgaggc aggaaaccgc tccggcctgg aggcgcccct gcccggactc 180 gtggactgca gcgctgtccc gctcggaagc cgatcctggg agtccgagcg gggccgcggg 240 gcctgcgtgg cagccgagtc gagggcttcg ggccgcgggg ctgtggagcg ccccttggcg 300 ctgggccgcc ggcagccgca tggctggctg tctggggcac tggggaaaca ggcccagagg 360 gcttcagact cccctaggct tcacaagaac tcccagctgc agggtgtgga aactatgccc 420 cagacgtccg ttgtcttctc cagcatcctt gggcccagct gtagcggaca ggtgcagcct 480 ggcatggggg agcgtggagg cggggccggt ggcggctccg gggacctcat cttccaagat 540 ggacacctca tctctgggtc cctggaggcc ctgatggagc accttgttcc cacggtggac 600 tattaccccg ataggacgta catcttcacc tttctcctga gctcccgggt ctttatgccc 660 cctcatgacc tgctggcccg cgtggggcag atctgcgtgg agcagaagca gcagctggaa 720 gccgggcctg aaaaggccaa gctgaagtct ttctcagcca agatcgtgca gctcctgaag 780 gagtggaccg aggccttccc ctatgacttc caggatgaga aggccatggc caagctgaaa 840 gccatcacac accgtgtcac ccagtgtgat gaggagaatg gcacagtgaa gaaggccatt 900 gcccagatga cacagagcct gttgctgtcc ttggctgccc ggagccagct ccaggaactg 960 cgagagaagc tccggccacc ggctgtagac aaggggccca tcctcaagac caagccacca 1020 gccgcccaga aggccatcct gggcgtgtgc tgcgaccccc tgttgctgtc ccagcagctg 1080 actcacattg agctggacag ggtcagcagc atttaccctg aggacttgat gcagatcgtc 1140 agccacatgg actccttgga caaccacagg tgccgagggg acctgaccaa gacctacagc 1200 ctggaggcct atgacaactg gttcaactgc ctgagcatgc tggtggccac tgaggtgtgc 1260 cggctcctgc tgtcccccgg cagcgccatg ctgaggtcac ctgccacctt ccaggtggtg 1320 aagaagaaac accggacccg catgttggag ttcttcattg atgtggcccg ggagtgcttc 1380 aacatcggga acttcaactc catgatggcc atcatctgtg agtgactggg cacccacggt 1440 ggggctgctg catggcccac ccccccggcc ctgaccctac ctgctgtcct ctgcagctgg 1500 c atg aac ctc agt cct gtg gca agg ctg aag aaa act tgg tcc aag 1546 Met Asn Leu Ser Pro Val Ala Arg Leu Lys Lys Thr Trp Ser Lys 1 5 10 15 gtc aag aca gcc aag ttt gat gtc ttg gag cat cac atg gac ccg tcc 1594 Val Lys Thr Ala Lys Phe Asp Val Leu Glu His His Met Asp Pro Ser 20 25 30 agc aac ttc tgc aac tac cgt aca gcc ctg cag ggg gcc acg cag agg 1642 Ser Asn Phe Cys Asn Tyr Arg Thr Ala Leu Gln Gly Ala Thr Gln Arg 35 40 45 tcc cag atg gcc aac agc agc cgt gaa aag atc gtc atc cct gtg ttc 1690 Ser Gln Met Ala Asn Ser Ser Arg Glu Lys Ile Val Ile Pro Val Phe 50 55 60 aac ctc ttc gtt aag gac atc tac ttc ctg cac aaa atc cat acc aac 1738 Asn Leu Phe Val Lys Asp Ile Tyr Phe Leu His Lys Ile His Thr Asn 65 70 75 cac ctg ccc aac ggg cac att aac ttt aag aaa ttc tgg gag atc tcc 1786 His Leu Pro Asn Gly His Ile Asn Phe Lys Lys Phe Trp Glu Ile Ser 80 85 90 95 aga cag atc cat gag ttc atg aca tgg aca cag gta gag tgt cct ttc 1834 Arg Gln Ile His Glu Phe Met Thr Trp Thr Gln Val Glu Cys Pro Phe 100 105 110 gag aag gac aag aag att cag agt tac ctg ctc acg gcg ccc atc tac 1882 Glu Lys Asp Lys Lys Ile Gln Ser Tyr Leu Leu Thr Ala Pro Ile Tyr 115 120 125 agc gag gaa gct ctc ttc gtc gcc tcc ttt gaa agt gag ggt ccc gag 1930 Ser Glu Glu Ala Leu Phe Val Ala Ser Phe Glu Ser Glu Gly Pro Glu 130 135 140 aac cac atg gaa aaa gac agc tgg aag acc ctc agg acc acc ctt ctg 1978 Asn His Met Glu Lys Asp Ser Trp Lys Thr Leu Arg Thr Thr Leu Leu 145 150 155 aac aga gcc tga ggc ggatgcagcc cgcgacgcca gaggaagcac gtgcactaac 2033 Asn Arg Ala * 160 tgggtttaaa ttttgactga tgtgggttga gatgaggagg cctcactggt tggggtccat 2093 tttgtatata acttttatga gaaaaaaatg gtaattattt cacgcatcaa cctttggcac 2153 ttacaaagtt ttttttgttt attttaaata acagggcagg gccctgcttt ggggaggggg 2213 aggggagagt atcatgggag atggtatcca tgataacatc ttattctaat gaaatgtaga 2273 tttttatttt ctacttttga ttattaacat cttatgaaaa aaatatttta aaaaacccag 2333 ccaaaaccaa cgtgagccct gcctgctcgg acgcctttcc agccagtgtc tctgacgtcg 2393 gggttagtgc cttagagggt actggggtct ggtcttcctg ctctgtggtt tgggctgcgg 2453 tgagtcccac tccacctggg cgcctgccct caggagcctg ggctgcgagg ctccatagga 2513 gggctggtgg ctgggaggtc gcgtccgcac acttctggaa gtgagccttt gagtacgggc 2573 tgtccaaagt ttacattttc attttccttt cagggatttg tggggtcagg gaggggcagg 2633 gggcacctgg cagcatattt tctgtgacaa tgtgtccagc aaatcattct tcaactacat 2693 tttagaaagg aggaaatcta aaataaggta agggagggaa gcatggagtt gtcagttttc 2753 tgggctgtga ctgaaagaca cactgagctg tgatgaagaa aaatacatgg ccgactccag 2813 ggtggtgaca tttagagcta gtcttgaaac ctatcatcta cagaggggag ggcagccaac 2873 agccctcttc ccacctgggt aggcagcgcc ctaattggaa ttggaaacag aaaattcgcc 2933 aggccatact gctggagccc attcagataa aactgcccaa tactgagagg tgttttctac 2993 acccagctag aggagcacac tccattttcc catgtctgac ttcgtggtgt gagccctggg 3053 ccctactgac catggcgcag gacagctgtc cttcagaaag cacacggtca atccacgtgg 3113 accgtctccc tcgcaggaac tccgcatcct tgtccctctc tgcattccca gtttccgcag 3173 gagccttgat caatggggaa gcctgggtga ggatgggcca ggtcccaatt cccaaagctc 3233 ctggaagagc ctgaagacat tgggaaaggc tgggcctggt gag 3276 50 6170 DNA Homo sapiens CDS (61)..(3075) 50 actttctgat tcatactgac cctggccaat gctgtaactt ctctttctgt ttcctttccc 60 atg cta ttt cat ccc ctc ctt ccg cac ctc att tct cca ccc ctt gac 108 Met Leu Phe His Pro Leu Leu Pro His Leu Ile Ser Pro Pro Leu Asp 1 5 10 15 cct ggg aca cta gcc ggg aac cag agg ctc cag cag gtc atg cac gcg 156 Pro Gly Thr Leu Ala Gly Asn Gln Arg Leu Gln Gln Val Met His Ala 20 25 30 gcg gat cct ctg gag atc cag gct gac gtg cac tgg act cat atc cgt 204 Ala Asp Pro Leu Glu Ile Gln Ala Asp Val His Trp Thr His Ile Arg 35 40 45 gag aga gag gag gaa gag agg atg gcg ccg gcc tct gag tcc tct gct 252 Glu Arg Glu Glu Glu Glu Arg Met Ala Pro Ala Ser Glu Ser Ser Ala 50 55 60 tcc gga gcc cca ttg gat gag aat gac cta gag gaa gat gtg gac tca 300 Ser Gly Ala Pro Leu Asp Glu Asn Asp Leu Glu Glu Asp Val Asp Ser 65 70 75 80 gaa cca gcc gag ata gaa ggg gag gca gca gag gat ggg gac cca ggg 348 Glu Pro Ala Glu Ile Glu Gly Glu Ala Ala Glu Asp Gly Asp Pro Gly 85 90 95 gac act ggt gct gag ctg gat gat gat cag cac tgg tct gac agc ccg 396 Asp Thr Gly Ala Glu Leu Asp Asp Asp Gln His Trp Ser Asp Ser Pro 100 105 110 tcg gat gct gac aga gag ctg cgt ttg ccg tgc cca gct gag ggg gaa 444 Ser Asp Ala Asp Arg Glu Leu Arg Leu Pro Cys Pro Ala Glu Gly Glu 115 120 125 gca gag ctg gag ctg agg gtg tcg gaa gat gag gag aag ctg ccc gcc 492 Ala Glu Leu Glu Leu Arg Val Ser Glu Asp Glu Glu Lys Leu Pro Ala 130 135 140 tca ccg aag cac caa gag aga ggt ccc tcc caa gcc acc agc ccc atc 540 Ser Pro Lys His Gln Glu Arg Gly Pro Ser Gln Ala Thr Ser Pro Ile 145 150 155 160 cgg tct ccc cag gaa tca gct ctt ctg ttc att cca gtc cac agc ccc 588 Arg Ser Pro Gln Glu Ser Ala Leu Leu Phe Ile Pro Val His Ser Pro 165 170 175 tca aca gag ggg ccc caa ctc cca cct gtc cct gcc gcc acc cag gag 636 Ser Thr Glu Gly Pro Gln Leu Pro Pro Val Pro Ala Ala Thr Gln Glu 180 185 190 aaa tca cct gag gag cgc ctt ttc cct gag cct ttg ctc ccc aaa gag 684 Lys Ser Pro Glu Glu Arg Leu Phe Pro Glu Pro Leu Leu Pro Lys Glu 195 200 205 aag ccc aaa gct gat gcc ccc tcg gat ctg aaa gct gtg cac tct ccc 732 Lys Pro Lys Ala Asp Ala Pro Ser Asp Leu Lys Ala Val His Ser Pro 210 215 220 atc cga tca cag cca gtg acc ctg cca gaa gct agg act cct gtc tca 780 Ile Arg Ser Gln Pro Val Thr Leu Pro Glu Ala Arg Thr Pro Val Ser 225 230 235 240 cca ggg agc ccg cag ccc cag cca ccc gtg gcg gcc tcc acg ccc cca 828 Pro Gly Ser Pro Gln Pro Gln Pro Pro Val Ala Ala Ser Thr Pro Pro 245 250 255 ccc agc cca ctc ccc atc tgc tcc cag ccc cag cct tcc acc gag gcc 876 Pro Ser Pro Leu Pro Ile Cys Ser Gln Pro Gln Pro Ser Thr Glu Ala 260 265 270 act gtc cca tcc cct acc cag tcc ccc ata cgc ttc cag cct gcc ccg 924 Thr Val Pro Ser Pro Thr Gln Ser Pro Ile Arg Phe Gln Pro Ala Pro 275 280 285 gcc aaa aca tcc acc cca ctg gcc cct ctc cct gtc caa agc caa agt 972 Ala Lys Thr Ser Thr Pro Leu Ala Pro Leu Pro Val Gln Ser Gln Ser 290 295 300 gac acc aag gac aga ctg ggc agc ccc ctt gct gtg gat gag gcc ctc 1020 Asp Thr Lys Asp Arg Leu Gly Ser Pro Leu Ala Val Asp Glu Ala Leu 305 310 315 320 aga cgg agc gac ctg gtg gag gag ttc tgg atg aag agt gcg gag atc 1068 Arg Arg Ser Asp Leu Val Glu Glu Phe Trp Met Lys Ser Ala Glu Ile 325 330 335 cgc cgc agc ctc ggg ctc aca cct gtg gac cgc agc aag ggg ccc gag 1116 Arg Arg Ser Leu Gly Leu Thr Pro Val Asp Arg Ser Lys Gly Pro Glu 340 345 350 ccc agc ttc ccc acg cct gcc ttc agg cca gtg tcc ctc aaa tcc tat 1164 Pro Ser Phe Pro Thr Pro Ala Phe Arg Pro Val Ser Leu Lys Ser Tyr 355 360 365 tcc gtt gaa aag tcc ccc cag gat gag gga ctc cac ctt ctc aag cct 1212 Ser Val Glu Lys Ser Pro Gln Asp Glu Gly Leu His Leu Leu Lys Pro 370 375 380 ctg tcc atc ccc aaa agg ctg ggc ctg cca aag ccg gaa ggc gag ccg 1260 Leu Ser Ile Pro Lys Arg Leu Gly Leu Pro Lys Pro Glu Gly Glu Pro 385 390 395 400 ttg tcc ctg cca acc ccc cgg tcc ccg tcc gac aga gag cta cgc agc 1308 Leu Ser Leu Pro Thr Pro Arg Ser Pro Ser Asp Arg Glu Leu Arg Ser 405 410 415 gcc cag gag gag cgc agg gag ctg tcc agc agc tct ggc ctg ggc ctg 1356 Ala Gln Glu Glu Arg Arg Glu Leu Ser Ser Ser Ser Gly Leu Gly Leu 420 425 430 cac ggg agc tcc tcc aac atg aag aca ctg ggc agc cag agc ttc aac 1404 His Gly Ser Ser Ser Asn Met Lys Thr Leu Gly Ser Gln Ser Phe Asn 435 440 445 acc tcg gac tcc gcc atg ctc acg ccc ccc tcc agc ccg ccc cca ccg 1452 Thr Ser Asp Ser Ala Met Leu Thr Pro Pro Ser Ser Pro Pro Pro Pro 450 455 460 cca ccc ccg ggc gag gag ccc gcc acc ttg cgg agg aag ctc agg gag 1500 Pro Pro Pro Gly Glu Glu Pro Ala Thr Leu Arg Arg Lys Leu Arg Glu 465 470 475 480 gcc gag ccc aat gcc tcg gtg gtc ccg ccg ccc ttg ccc gcc acc tgg 1548 Ala Glu Pro Asn Ala Ser Val Val Pro Pro Pro Leu Pro Ala Thr Trp 485 490 495 atg cgg ccc ccc cgg gag cct gct cag ccc ccc aga gag gag gtg cgg 1596 Met Arg Pro Pro Arg Glu Pro Ala Gln Pro Pro Arg Glu Glu Val Arg 500 505 510 aag tcg ttt gtg gag agt gtg gag gag att ccc ttt gct gat gat gtg 1644 Lys Ser Phe Val Glu Ser Val Glu Glu Ile Pro Phe Ala Asp Asp Val 515 520 525 gag gac acc tat gac gac aag act gag gac tca agc ctg cag gag aaa 1692 Glu Asp Thr Tyr Asp Asp Lys Thr Glu Asp Ser Ser Leu Gln Glu Lys 530 535 540 ttc ttc acg ccc ccg tcc tgc tgg ccg cgc ccc gag aag cct cgc cac 1740 Phe Phe Thr Pro Pro Ser Cys Trp Pro Arg Pro Glu Lys Pro Arg His 545 550 555 560 ccg ccc ctg gcc aag gag aac ggg agg ctg cct gct ctg gag ggg acg 1788 Pro Pro Leu Ala Lys Glu Asn Gly Arg Leu Pro Ala Leu Glu Gly Thr 565 570 575 ctg cag cca cag aag agg ggg ctg ccc ttg gtg tcc gcg gaa gcc aag 1836 Leu Gln Pro Gln Lys Arg Gly Leu Pro Leu Val Ser Ala Glu Ala Lys 580 585 590 gag ttg gcc gag gag cgc atg cga gcc agg gag aag tcc gtg aag agc 1884 Glu Leu Ala Glu Glu Arg Met Arg Ala Arg Glu Lys Ser Val Lys Ser 595 600 605 cag gcg ctg cgg gat gcc atg gcc agg cag ctg agc agg atg cag cag 1932 Gln Ala Leu Arg Asp Ala Met Ala Arg Gln Leu Ser Arg Met Gln Gln 610 615 620 atg gag ctg gcc tca ggc gcc ccc agg ccc cgc aag gcg tcc tca gca 1980 Met Glu Leu Ala Ser Gly Ala Pro Arg Pro Arg Lys Ala Ser Ser Ala 625 630 635 640 ccc tcc cag ggc aag gag cgc cgg cct gac tcc ccc aca cgc ccc act 2028 Pro Ser Gln Gly Lys Glu Arg Arg Pro Asp Ser Pro Thr Arg Pro Thr 645 650 655 ctc agg ggc tcc gag gag ccc acc ctg aag cat gaa gcc acc agc gag 2076 Leu Arg Gly Ser Glu Glu Pro Thr Leu Lys His Glu Ala Thr Ser Glu 660 665 670 gag gtc ctc tcc ccg ccg tcg gac tca ggg ggc cca gat ggc tct ttc 2124 Glu Val Leu Ser Pro Pro Ser Asp Ser Gly Gly Pro Asp Gly Ser Phe 675 680 685 act tca tcc gag ggc tcc agt ggg aag agc aag aag agg tcg tca ctc 2172 Thr Ser Ser Glu Gly Ser Ser Gly Lys Ser Lys Lys Arg Ser Ser Leu 690 695 700 ttc tcc ccc cgc aga aac aag aag gag aag aag tcc aaa ggc gag ggc 2220 Phe Ser Pro Arg Arg Asn Lys Lys Glu Lys Lys Ser Lys Gly Glu Gly 705 710 715 720 cgg ccc ccg gag aag ccc agc tcc aac ctc ctg gaa gaa gcc gcc gcc 2268 Arg Pro Pro Glu Lys Pro Ser Ser Asn Leu Leu Glu Glu Ala Ala Ala 725 730 735 aaa ccc aag tcc ctg tgg aag tcc gtc ttc tcc ggg tac aag aag gac 2316 Lys Pro Lys Ser Leu Trp Lys Ser Val Phe Ser Gly Tyr Lys Lys Asp 740 745 750 aag aag aag aag gcc gac gac aag tcc tgc ccc agc acc ccc tcc agc 2364 Lys Lys Lys Lys Ala Asp Asp Lys Ser Cys Pro Ser Thr Pro Ser Ser 755 760 765 ggg gcc acg gtg gac tct gga aag cac agg gtg ctt ccc gtc gta agg 2412 Gly Ala Thr Val Asp Ser Gly Lys His Arg Val Leu Pro Val Val Arg 770 775 780 gca gag ctg cag ctc cgg cgc cag ctg agc ttc tcc gag gac tca gac 2460 Ala Glu Leu Gln Leu Arg Arg Gln Leu Ser Phe Ser Glu Asp Ser Asp 785 790 795 800 ctc tcc agc gac gat gtc ctt gag aag tcc tca cag aag tcc cgg cga 2508 Leu Ser Ser Asp Asp Val Leu Glu Lys Ser Ser Gln Lys Ser Arg Arg 805 810 815 gag cca aga acc tac acg gag gag gaa ctg aat gcc aag ctg acc cgg 2556 Glu Pro Arg Thr Tyr Thr Glu Glu Glu Leu Asn Ala Lys Leu Thr Arg 820 825 830 cgt gtg caa aag gca gct cgg aga cag gcc aag cag gag gag ctt aag 2604 Arg Val Gln Lys Ala Ala Arg Arg Gln Ala Lys Gln Glu Glu Leu Lys 835 840 845 cgg ctg cat cga gcc cag atc atc cag cgg cag ctg cag cag gtg gag 2652 Arg Leu His Arg Ala Gln Ile Ile Gln Arg Gln Leu Gln Gln Val Glu 850 855 860 gag agg cag cgg cgg ctg gag gaa agg ggc gtg gct gtg gag aag gcg 2700 Glu Arg Gln Arg Arg Leu Glu Glu Arg Gly Val Ala Val Glu Lys Ala 865 870 875 880 ctc cgg ggc gaa gca ggc atg ggc aag aag gac gac ccc aag ctg atg 2748 Leu Arg Gly Glu Ala Gly Met Gly Lys Lys Asp Asp Pro Lys Leu Met 885 890 895 cag gag tgg ttc aag cta gtg cag gag aag aac gcc atg gtg cgc tac 2796 Gln Glu Trp Phe Lys Leu Val Gln Glu Lys Asn Ala Met Val Arg Tyr 900 905 910 gag tcg gag ctg atg atc ttt gcc cgg gag ctg gag ctg gaa gac cgg 2844 Glu Ser Glu Leu Met Ile Phe Ala Arg Glu Leu Glu Leu Glu Asp Arg 915 920 925 cag agt cga ctg cag cag gag ctc cgg gaa cgc atg gca gtg gaa gat 2892 Gln Ser Arg Leu Gln Gln Glu Leu Arg Glu Arg Met Ala Val Glu Asp 930 935 940 cac ctt aag act gag gag gag ctg tca gaa gag aag cag att ctc aat 2940 His Leu Lys Thr Glu Glu Glu Leu Ser Glu Glu Lys Gln Ile Leu Asn 945 950 955 960 gag atg ctg gag gtg gtg gag cag aga gac tca ctg gtg gcg ctg ctg 2988 Glu Met Leu Glu Val Val Glu Gln Arg Asp Ser Leu Val Ala Leu Leu 965 970 975 gag gag cag cgg ctc cgg gag aga gag gag gac aag gac ctg gag gct 3036 Glu Glu Gln Arg Leu Arg Glu Arg Glu Glu Asp Lys Asp Leu Glu Ala 980 985 990 gcc atg ctg tcc aag ggc ttc agc ctt aac tgg tcc tga gctcccaccc 3085 Ala Met Leu Ser Lys Gly Phe Ser Leu Asn Trp Ser * 995 1000 aacgctccat tttctgttgg catccgcctg gccaggcagt ggcatccaaa ccacccggag 3145 ccgcgatctg aggaggcctg gcacctcctt ggagtttacg ctcagatgcc cgtgtgctgc 3205 ttggaaagtg gtcgagtccc gcgtgcagtg gggagcccca ggtgacagtg gttatctgag 3265 acggctccac ctcctgggag gaggcccacc tggacctccc actcagagga ggagcacggc 3325 gtgtatggca tgacgcaggg gaccaccccg cgcgctccct gaggatgtgc tggctgtgcc 3385 ccttttttcc actggcacat ttggtaagag agggaagctg ctccccgtca gaaccacagt 3445 gcgccgtgcg aggggcactg tcttcttcat gctccctgga gcaccaccaa agaaacgtaa 3505 acaatacccc acgaaagcag ggtcaggggt cagggtgcga tcgagaccca ggatgggggc 3565 gtccagtcat gcccacccca gcatcacagg agacatggag gtgcgggcag gctcctgaat 3625 tattatgcaa attaggagga cgcaggaggg gtctgccctc cagccgaaca ccacacactg 3685 gaccctaagt ggccaaatgc ctgggccgct tgctggctgt ggcctgaggc ttgtgggttg 3745 ctgcattttg cttgtagttc acaaccattt tgacactgga aaatgctgac tttgggggac 3805 aggatgaggc cctacattct aagcccccag ttggcagaca ggcattgtcc ctgttccaca 3865 tttatgtcgg gacaggagat gaccttttcc tccgtgtttt tcctgtgttt gcacgttgaa 3925 atgaagctga caacctggca agacgctcag ccgcttcaaa ccctttttgt caattaactt 3985 attttttaat acttgaaaag aagtaacttc gtttgtgtat ctttactaga ggaactgatc 4045 acctgcgccc gggtgcggga gccacagcgg catctggtgc gtcctacgcg acctggtccg 4105 gggctgcccg gtgctcctca cgtgcatcta tttattagcc tttctcttcg tatcactggc 4165 ctggctggca tcagggagct gcccagaacc cctgtgtggg tcctcctcac agctttctgt 4225 cccctcctcc acccggtgcc tgcctcaccc tggcgctaga ccatctggac cactcatgtg 4285 atgagggtgc atttccgttc tgttttgggc caggccaaca gcagagctgc cactctcacc 4345 ctcccagtga gaattccggc tctgcagaac tcgcccttgt ctcagtttgg gggccagggc 4405 atcaccttcc tccgcacata tgttaaagaa ggtttcagga tgggccctca tccacacagg 4465 ccaagagggt gcaaggtggg accctggaat catgtggctg gtgagaattc cctcctccca 4525 gcctaagatt cacccagacg gaaacgcgag tgctgcagtg gatgctggga tgcaggctgg 4585 tgcctctcaa gcagatcagc gactcccgcc cttccttccg aaggcgacgg gcacctgcct 4645 tgtgccggat cttcatgggg acataaaggg cgagccccga atcactagct cctatagcca 4705 aactgttcct ttctcacggt tccgcaccag cctggctgtg tacagctcat caagccactg 4765 agctaatcgg ggtggggtgc ttgtccatca aagcagtcag cacatagccc agcgaggagc 4825 catccggacc agacccgcct gccaggggcg ctccagccgc ctgccgcctc cgtgggctgg 4885 ggccagctgg gagcagaggc ctggcccctc caaggcgtcc cgcagtggag caggctgagt 4945 ggctgtgctg accttgggct ttccatggga accacactgt gcttcaactt gaacattcat 5005 cccagctgca aaggagcaaa gaacctgtgt catccttgtc tgtccagaag ctgccatctc 5065 tctcccatgc acatccggga gataacgccg ctagtggccg ccacagcgct gattctccac 5125 ctgttctcac gtgaggcaaa ggtctccttt tcttttctct ttacttaaaa aaggggagag 5185 gagggtttct agattccatc ttcaaacccc agtcgtccca taaaattgga cgtgggaaaa 5245 gaccttcact gcctgcgtgg cctttcccag acctctcctc gaatgccaca aaaccggtcc 5305 agccccggca gagccgcctt cggcccttgt agctcctgct ggccccacaa cagggaaaca 5365 gtttgcaaag tggcttggaa ggagtgtggc ctagggatct gctgtaagtg gccagacgta 5425 gcaggagagc ccagtgtcac cttctggtcc tggtcagcct taacactgtg gcatcctcca 5485 cagcacacac ggcagcctgc agtctggaag gtggacccga gcctttgcag agaggcgccg 5545 cagagccgca ggcctgcgcc ccagccttct gctcccgact gtgaggacac ccagttctag 5605 tagagcactt tttttaaagc tcccattttg taaccactag tttgcggttg acttgagtac 5665 tctggtgact tcctgcgtca agcgttctca agctgtgaga atgtgcgcag ctccaggcag 5725 gttttctctc ggagagttaa gtcttccctt gaaggcaggg aagcaggatg gatacacata 5785 tatcacacac ataaaacacc aggtgcggga gcagcccaga ctcaaggctg actaaactgg 5845 aggctgaata ccgtggaggt ccacatgcag cttccctgga gggcaggccg gaggcgctcc 5905 cgcccctggg cttgaggatg ctgcaccccg tgggcttcca ggcctgccca gatgatgcct 5965 tcaggcctct gtccctggcg gccatcctca ggccgatttt gaccagcaat gatagactct 6025 tcttaaccct ttcaaaataa atttttcagt gggacagaaa ggagagttaa aaaacatttt 6085 tttaaaggtg gtaacatctg acccacaaag ggaatgggtc tgttttatgc aaaataaaag 6145 tttttcaaat aaaaaaaaaa aaaaa 6170 51 1303 DNA Homo sapiens CDS (801)..(1268) 51 cactgcacgt accggtccgg aattcccggg tcgacgattt cgtccttccc ccaggccttc 60 accatggccg agtcccccgg ctgctgctcc gtctgggccc gctgcctcca ctgcctgtat 120 agctgccact agaggaaatg ccccagagag aggatgcaaa ccagcaagtg cgactgtatc 180 tggtttggcc tgctcttcct caccttcctc ctttccctga gctggctgta catcgggctc 240 gtccttctca atgacctgca caacttcaat gaattcctct tccgccgctg gggacactgg 300 atggactggt ccctggcatt cctgctggtc atctctctac tgggcacata tgcatccttg 360 ctattggtcc tggccctgct cctgcggctt tgtagacagc ccctgcatct gcacagcctc 420 cacaaggtgc tgctgctcct cattatgctg cttgtggcgg ctggccttgt gggactggac 480 atccaatggc agcaggagtg gcatagcttg cgtgtgtcac tgcagactgc aggtagctct 540 gaactccagc agtcaggccc taagaggaaa gcggggaggg gcactggaga agagcccacc 600 tcaccagctc ttgtccacag gccacagccc cattccttca tattggagca gccgctggaa 660 ttgccctcct ggcctggcct gtggctgata ccttctaccg tatccaccga agaggtccca 720 agattctgct actgctccta ttttttggag ttgtcctggt catctacttg gcccccctat 780 gcatctcctc accctgcatc atg gaa ccc aga gac tta cca ccc aag cct 830 Met Glu Pro Arg Asp Leu Pro Pro Lys Pro 1 5 10 ggg ctg gtg gga cac cga ggg gcc ccc atg ctg gct ccc gag aac acc 878 Gly Leu Val Gly His Arg Gly Ala Pro Met Leu Ala Pro Glu Asn Thr 15 20 25 ctg atg tcc ttg cgg aag aca gct gaa tgc gga gct act gtg ttt gag 926 Leu Met Ser Leu Arg Lys Thr Ala Glu Cys Gly Ala Thr Val Phe Glu 30 35 40 act gat gtg atg gtc agc tcc gat ggg gtc ccc ttc ctc atg cat gat 974 Thr Asp Val Met Val Ser Ser Asp Gly Val Pro Phe Leu Met His Asp 45 50 55 gag cac ctc agc agg acc acg aat gta gcc tct gta ttc cca acc cga 1022 Glu His Leu Ser Arg Thr Thr Asn Val Ala Ser Val Phe Pro Thr Arg 60 65 70 atc aca gcc cac agc agt gac ttc tcc tgg act gaa ctg aag aga ctc 1070 Ile Thr Ala His Ser Ser Asp Phe Ser Trp Thr Glu Leu Lys Arg Leu 75 80 85 90 aat gct gga tcc tgg ttc cta gag agg cga ccc ttc tgg ggg gcc aaa 1118 Asn Ala Gly Ser Trp Phe Leu Glu Arg Arg Pro Phe Trp Gly Ala Lys 95 100 105 ccg ctg gca ggc cct gat cag aaa gag gct gag agt cag acg gta cca 1166 Pro Leu Ala Gly Pro Asp Gln Lys Glu Ala Glu Ser Gln Thr Val Pro 110 115 120 gca tta gaa gag cta ttg gag gaa gct gca gcc ctc aac ctt tcc atc 1214 Ala Leu Glu Glu Leu Leu Glu Glu Ala Ala Ala Leu Asn Leu Ser Ile 125 130 135 atg ttc gac ttg cgc cga ccc cca cag aac cac aat gta ttg cgg ccg 1262 Met Phe Asp Leu Arg Arg Pro Pro Gln Asn His Asn Val Leu Arg Pro 140 145 150 ctc tag aggatcctag cttacgtacg cgtgcatgcg acgac 1303 Leu * 155 52 481 DNA Homo sapiens CDS (65)..(409) 52 cgtcaggtac cggtccggaa ttcccgggtc gacccacgcg tccgaaggag tcctgcttat 60 caca atg aat gtt ctc ctg ggc agc gtt gtg atc ttt gcc acc ttc gtg 109 Met Asn Val Leu Leu Gly Ser Val Val Ile Phe Ala Thr Phe Val 1 5 10 15 act tta tgc aat gca tca tgc tat ttc ata cct aat gag gga gtt cca 157 Thr Leu Cys Asn Ala Ser Cys Tyr Phe Ile Pro Asn Glu Gly Val Pro 20 25 30 gga gat tca acc agg aaa tgc atg gat ctc aaa gga aac aaa cac cca 205 Gly Asp Ser Thr Arg Lys Cys Met Asp Leu Lys Gly Asn Lys His Pro 35 40 45 ata aac tcg gag tgg cag act gac aac tgt gag aca tgc act tgc tac 253 Ile Asn Ser Glu Trp Gln Thr Asp Asn Cys Glu Thr Cys Thr Cys Tyr 50 55 60 gaa aca gaa att tca tgt tgc acc ctt gtt tct aca cct gtg ggt tat 301 Glu Thr Glu Ile Ser Cys Cys Thr Leu Val Ser Thr Pro Val Gly Tyr 65 70 75 gac aaa gac aac tgc caa aga atc ttc aag aag gag gac tgc aag tat 349 Asp Lys Asp Asn Cys Gln Arg Ile Phe Lys Lys Glu Asp Cys Lys Tyr 80 85 90 95 atc gtg gtg gag aag aag gac cca aaa aag acc tgt tct gtc agt gaa 397 Ile Val Val Glu Lys Lys Asp Pro Lys Lys Thr Cys Ser Val Ser Glu 100 105 110 tgg ata atc taa tgt gcttctagta ggcacagggc tcccaggcca ggcctcattc 452 Trp Ile Ile * tcctctggcc tctaatagtc aatgaatgg 481 53 903 DNA Homo sapiens CDS (116)..(547) 53 cgatagttct agagcgcgag cgaccgcacc cgtggctttc tctgggcgac cgggtcccag 60 actcccccca gcacagcaga gcgcttccct gcccacccgc ggaaaccgcc ccagg atg 118 Met 1 cag cac cga ggc ttc ctc ctc ctc acc ctc ctc gcc ctg ctg gcg ctc 166 Gln His Arg Gly Phe Leu Leu Leu Thr Leu Leu Ala Leu Leu Ala Leu 5 10 15 acc tcc gcg gtc gcc aaa aag aaa gat aag gtg aag aag ggc ggc ccg 214 Thr Ser Ala Val Ala Lys Lys Lys Asp Lys Val Lys Lys Gly Gly Pro 20 25 30 ggg agc gag tgc gct gag tgg gcc tgg ggg ccc tgc acc ccc agc agc 262 Gly Ser Glu Cys Ala Glu Trp Ala Trp Gly Pro Cys Thr Pro Ser Ser 35 40 45 aag gat tgc ggc gtg ggt ttc cgc gag ggc acc tgc ggg gcc cag acc 310 Lys Asp Cys Gly Val Gly Phe Arg Glu Gly Thr Cys Gly Ala Gln Thr 50 55 60 65 cag cgc atc cgg tgc agg gtg ccc tgc aac tgg aag aag gag ttt gga 358 Gln Arg Ile Arg Cys Arg Val Pro Cys Asn Trp Lys Lys Glu Phe Gly 70 75 80 gcc gac tgc aag tac aag ttt gag aac tgg ggt gcg tgt gat ggg ggc 406 Ala Asp Cys Lys Tyr Lys Phe Glu Asn Trp Gly Ala Cys Asp Gly Gly 85 90 95 aca ggc acc aaa gtc cgc caa ggc acc ctg aag aag gcg cgc tac aat 454 Thr Gly Thr Lys Val Arg Gln Gly Thr Leu Lys Lys Ala Arg Tyr Asn 100 105 110 gct cag tgc cag gag acc atc cgc gtc acc aag ccc tgc acc ccc aag 502 Ala Gln Cys Gln Glu Thr Ile Arg Val Thr Lys Pro Cys Thr Pro Lys 115 120 125 acc aaa gca aag gcc aaa gcc aag aaa ggg aag gga aag gac tag acg 550 Thr Lys Ala Lys Ala Lys Ala Lys Lys Gly Lys Gly Lys Asp * 130 135 140 ccaagcctgg atgccaagga gcccctggtg tcacatgggg cctggcccac gccctccctc 610 tcccaggccc gagatgtgac ccaccagtgc cttctgtctg ctcgttagct ttaatcaatc 670 atgccctgcc ttgtccctct cactccccag ccccacccct aagtgcccaa agtggggagg 730 gacaagggat tctgggaagc ttgagcctcc cccaaagcaa tgtgagtccc agagcccgct 790 tttgttcttc cccacaattc cattactaag aaacacatca aataaactga ctttttcccc 850 ccaataaaag ctcttctttt ttaatatata aaagcccctt aaaaaaaaaa aaa 903 54 1148 DNA Homo sapiens CDS (33)..(1004) 54 cctagagagc ctgtgatata accatctatt cg atg atg aag ata ccc cac caa 53 Met Met Lys Ile Pro His Gln 1 5 acc caa aaa aag aga tct ctc gag gat ccg aat tcg cgg ccg cgt cga 101 Thr Gln Lys Lys Arg Ser Leu Glu Asp Pro Asn Ser Arg Pro Arg Arg 10 15 20 ctg cgg ctc gtg ggg cag ctc ggc agc atg gcg tcc gtg acg ctg agc 149 Leu Arg Leu Val Gly Gln Leu Gly Ser Met Ala Ser Val Thr Leu Ser 25 30 35 gag gcg gag aag gtg tac atc gtg cat ggc gtc cag gaa gac ctc cgt 197 Glu Ala Glu Lys Val Tyr Ile Val His Gly Val Gln Glu Asp Leu Arg 40 45 50 55 gtg gat ggc cgt ggc tgt gag gac tac cga tgt gtc gaa gtg gaa act 245 Val Asp Gly Arg Gly Cys Glu Asp Tyr Arg Cys Val Glu Val Glu Thr 60 65 70 gat gtg gtg tcc aac act agt ggg tcc gcc agg gtc aag ctg ggt cac 293 Asp Val Val Ser Asn Thr Ser Gly Ser Ala Arg Val Lys Leu Gly His 75 80 85 aca gac atc ttg gtg gga gtg aaa gca gaa atg ggg acg ccg aag ctg 341 Thr Asp Ile Leu Val Gly Val Lys Ala Glu Met Gly Thr Pro Lys Leu 90 95 100 gag aaa cca aat gaa ggc tac ttg gag ttc ttt gtt gac tgt tca gcc 389 Glu Lys Pro Asn Glu Gly Tyr Leu Glu Phe Phe Val Asp Cys Ser Ala 105 110 115 agt gct acc cct gaa ttt gaa ggt aga gga ggt gat gac ctt ggc acc 437 Ser Ala Thr Pro Glu Phe Glu Gly Arg Gly Gly Asp Asp Leu Gly Thr 120 125 130 135 gag atc gct aac acc ctc tat cgg ata ttt aac aat aaa agc agt gtc 485 Glu Ile Ala Asn Thr Leu Tyr Arg Ile Phe Asn Asn Lys Ser Ser Val 140 145 150 gac tta aag acc ctc tgc att agt cct cgg gag cac tgc tgg gtt ctc 533 Asp Leu Lys Thr Leu Cys Ile Ser Pro Arg Glu His Cys Trp Val Leu 155 160 165 tat gtg gat gtg ctg ctt ctg gaa tgt ggt gga aat ttg ttt gat gcc 581 Tyr Val Asp Val Leu Leu Leu Glu Cys Gly Gly Asn Leu Phe Asp Ala 170 175 180 att tcc att gct gta aag gct gct ctc ttc aat aca agg ata cca agg 629 Ile Ser Ile Ala Val Lys Ala Ala Leu Phe Asn Thr Arg Ile Pro Arg 185 190 195 gtt cga gtt ttg gag gat gaa gag ggg tcg aag gac att gaa ttg tca 677 Val Arg Val Leu Glu Asp Glu Glu Gly Ser Lys Asp Ile Glu Leu Ser 200 205 210 215 gat gac cct tat gac tgc ata cga cta agt gtg gag aat gtc ccc tgc 725 Asp Asp Pro Tyr Asp Cys Ile Arg Leu Ser Val Glu Asn Val Pro Cys 220 225 230 att gtc act ctg tgc aag att ggc tat cgg cat gtg gtg gat gct act 773 Ile Val Thr Leu Cys Lys Ile Gly Tyr Arg His Val Val Asp Ala Thr 235 240 245 ctt cag gag gag gcc tgc tcg ctg gcc agc ttg ctg gtg tcg gtg acc 821 Leu Gln Glu Glu Ala Cys Ser Leu Ala Ser Leu Leu Val Ser Val Thr 250 255 260 agc aag gga gtt gtg acg tgc atg agg aaa gtg ggg aag ggc agc ctg 869 Ser Lys Gly Val Val Thr Cys Met Arg Lys Val Gly Lys Gly Ser Leu 265 270 275 gac cca gag agc atc ttc gag atg atg gag act ggc aag cgt gtg ggc 917 Asp Pro Glu Ser Ile Phe Glu Met Met Glu Thr Gly Lys Arg Val Gly 280 285 290 295 aag gta ctg cat gcc tcc ttg cag agt gtt gtg cac aag gaa gaa agc 965 Lys Val Leu His Ala Ser Leu Gln Ser Val Val His Lys Glu Glu Ser 300 305 310 ctg ggg ccc aag aga cag aaa gtt gga ttc ctg gga tga tttgcacatc 1014 Leu Gly Pro Lys Arg Gln Lys Val Gly Phe Leu Gly * 315 320 aactgctcaa ctgtggattg ttttttactt ttccttttaa accggttcgt atatattttt 1074 cttcgctgtt acgaatttac agcagcattt gtacatgtaa aattaaaggc tattttctgg 1134 tcaaaaaaaa aaaa 1148 55 1223 DNA Homo sapiens CDS (742)..(1194) misc_feature (1)...(1223) n = a,t,c or g 55 gactctggcc tcttgcctga agtcttccgc ccatggtcaa gatggcgccc caatccgcaa 60 agccgtgggg ctagagacgc aggcgagggg aagcagctta agggggcggg gccaaagcta 120 agtctagnna gganngaaga ggaacctcac ttccgttgtc aggtggcggc gccgcagcca 180 tggagggagg cggcggcggc ggcggcggcg gctcgggtgg ctgcgctggg aggcggcggt 240 gagaggctcg cacgcctcca gcccggcccc ggccccccgg gagggagagc cgagcagccc 300 cggctctggg ctacggacta tgggcgaata gctctgacca cccggcgaag agctgagaat 360 gaagcggaga gcatcagaca gaggagctgg ggaaacgtcg gccagggcca aggctctagg 420 aagtgggatt tctggaaata atgcaaagag agctggacca ttcatccttg gtccccgtct 480 gggcaactca ccggtgccaa gcatagtgca gtgtttggcg aggaaagatg gcacggatga 540 cttctatcag ctgaagatcc tgaccctgga ggagaggggg gaccaaggca tagagagcca 600 ggaagagcgg cagggcaaga tgctgctgca caccgagtac tcactgctgt ctctcctgca 660 cacgcaggat ggcgtggtgc accaccacgg cctcttccag gacccacctg tgaaatcgtt 720 gaggacacag aatccagccg g atg gtt aag aag atg aag aag cgc atc tgc 771 Met Val Lys Lys Met Lys Lys Arg Ile Cys 1 5 10 ctc gtc ctg gac tgc ctc tgt gct cat gac ttc agt gat aag acc gct 819 Leu Val Leu Asp Cys Leu Cys Ala His Asp Phe Ser Asp Lys Thr Ala 15 20 25 gac ctc atc aac ctg cag cac tac gtc atc aag gag aag agg ctc agc 867 Asp Leu Ile Asn Leu Gln His Tyr Val Ile Lys Glu Lys Arg Leu Ser 30 35 40 gag agg gag act gtg gta atc ttc tac gac gtg gtc cgc gtg gtg gag 915 Glu Arg Glu Thr Val Val Ile Phe Tyr Asp Val Val Arg Val Val Glu 45 50 55 gcc ctg cac cag aaa aat atc gtg cac aga gac ctg aag ctg ggg aac 963 Ala Leu His Gln Lys Asn Ile Val His Arg Asp Leu Lys Leu Gly Asn 60 65 70 atg gtg ctc aac aag agg aca cat cgg ata acc atc acc aac ttc tgc 1011 Met Val Leu Asn Lys Arg Thr His Arg Ile Thr Ile Thr Asn Phe Cys 75 80 85 90 ctc ggg aag cat ctg gtg agc gag ggg gac ctg ctg aag gac cag aga 1059 Leu Gly Lys His Leu Val Ser Glu Gly Asp Leu Leu Lys Asp Gln Arg 95 100 105 ggg agc cct gcc tac atc agt ccc gac gtg ctc agc ggg aac cgg ggc 1107 Gly Ser Pro Ala Tyr Ile Ser Pro Asp Val Leu Ser Gly Asn Arg Gly 110 115 120 tta cac cag ctc cca tcc ctc caa cat gga gaa ctt ggg gtt gtc tcc 1155 Leu His Gln Leu Pro Ser Leu Gln His Gly Glu Leu Gly Val Val Ser 125 130 135 aag atc ctt cat ccc cac cat gta ttg cgg ccg ctc tag aggatccaag 1204 Lys Ile Leu His Pro His His Val Leu Arg Pro Leu * 140 145 150 cttacgtacg cgtgcatgt 1223 

What is claimed is:
 1. An isolated polynucleotide comprising a nucleotide sequence selected from the group consisting of SEQ ID NO: 1-55, or the mature protein coding portion thereof, the active domain thereof, or the complement thereof.
 2. An isolated polynucleotide encoding a polypeptide with biological activity, said polynucleotide which hybridizes to the polynucleotide of claim 1 under stringent hybridization conditions.
 3. An isolated polynucleotide encoding a polypeptide with biological activity, said polynucleotide having greater than about 90% sequence identity with the polynucleotide of claim
 1. 4. The polynucleotide of claim 1 which is a DNA.
 5. An isolated polynucleotide which comprises the complement of the polynucleotide of claim
 1. 6. A vector comprising the polynucleotide of claim
 1. 7. An expression vector comprising the polynucleotide of claim
 1. 8. A host cell genetically engineered to contain the polynucleotide of claim
 1. 9. A host cell genetically engineered to contain the polynucleotide of claim 1 in operative association with a regulatory sequence that controls expression of the polynucleotide in the host cell.
 10. A composition comprising a polypeptide, wherein the polypeptide is selected from the group consisting of: (a) a polypeptide encoded by any one of the polynucleotide of claim 1; (b) a polypeptide encoded by a polynucleotide hybridizing under stringent conditions with any one of SEQ ID NO: 1-55; and (c) a variant of the protein (a) or (b).
 11. A composition comprising the polypeptide of claim 10 and a carrier.
 12. An antibody directed against the polypeptide of claim
 10. 13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: a) contacting the sample with a compound that binds to and forms a complex with the polynucleotide of claim 1 for a period sufficient to form the complex; and b) detecting the complex, so that if a complex is detected, the polynucleotide of claim 1 is detected.
 14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: a) contacting the sample under stringent hybridization conditions with nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; b) amplifying a product comprising at least a portion of the polynucleotide of claim 1; and c) detecting said product and thereby the polynucleotide of claim 1 in the sample.
 15. The method of claim 14, wherein the polynucleotide is an RNA molecule that encodes a polypeptide of claim 10, and the method further comprises reverse transcribing an annealed RNA molecule into a cDNA polynucleotide.
 16. A method for detecting the polypeptide of claim 10 in a sample, comprising: a) contacting the sample with a compound that binds to and forms a complex with the polypeptide under conditions and for a period sufficient to form the complex; and b) detecting formation of the complex, so that if a complex formation is detected, the polypeptide of claim 10 is detected.
 17. A method for identifying a compound that binds to the polypeptide of claim 10, comprising: a) contacting the compound with the polypeptide of claim 10 under conditions and for a time sufficient to form a polypeptide/compound complex; and b) detecting the complex, so that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide of claim 10 is identified.
 18. A method for identifying a compound that binds to the polypeptide of claim 10, comprising: a) contacting the compound with the polypeptide of claim 10, in a cell, for a time sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a reporter gene sequence in the cell; and b) detecting the complex by detecting reporter gene sequence expression, so that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide of claim 10 is identified.
 19. A method of producing the polypeptide of claim 10, comprising, a) culturing the host cell of claim 8 for a period of time sufficient to express the polypeptide in said cell; and b) isolating the polypeptide from the cell culture or cells of step (a).
 20. An isolated polypeptide comprising an amino acid selected from the group consisting of any one of the polypeptides from the Sequence Listing, the mature protein portion thereof, or the active domain thereof.
 21. The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide array.
 22. A collection of polynucleotides, wherein the collection comprising the sequence information of at least one of SEQ ID NO: 1-55.
 23. The collection of claim 22, wherein the collection is provided on a nucleic acid array.
 24. The array of claim 23, wherein the array detects full-matches to any one of the polynucleotides in the collection of claim
 22. 25. The array of claim 22, wherein the array detects mismatches to any one of the polynucleotides in the collection of claim
 22. 26. The collection of claim 22, wherein the collection is provided in a computer-readable format.
 27. A method of treatment comprising administering to a mammalian subject in need thereof a therapeutic amount of a composition comprising a polypeptide of claim 10 or 20 and a pharmaceutically acceptable carrier.
 28. A method of treatment comprising administering to a mammalian subject in need thereof a therapeutic amount of a composition comprising an antibody that specifically binds to a polypeptide of claim 10 or 20 and a pharmaceutically acceptable carrier. 